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Abstract 

One-dimensional cut-and-project point sets obtained from the square lattice in the 
plane are considered from a unifying point of view and in the perspective of aperiodic 
wavelet constructions. We successively examine their geometrical aspects, combinato- 
rial properties from the point of view of the theory of languages, and self-similarity 
with algebraic scaling factor 9. We explain the relation of the cut-and-project sets to 
non-standard numeration systems based on 9. We finally examine the substitutivity, a 
weakened version of substitution invariance, which provides us with an algorithm for 
symbolic generation of cut-and-project sequences. 
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1 Introduction 

Initially introduced by Y. Meyer [481 149j in the context of Harmonic Analysis and more 
specifically of harmonious sets, the cut-and-project sets or model sets have become during 
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the two last decades a kind of geometrical paradigm in quasicrystalline studies. Qua- 
sicrystals are those alloys whose first sample was discovered in 1982 by Shechtman, Blech, 
Gratias, and Cahn [HJ, namely the alloy AIq^Muqh, characterized by 

i) a diffraction pattern like a dense constellation of more-or-less bright spots, which is 
an indication of a long-range order, 

ii) a spatial organisation of those Bragg peaks obeying five- or ten-fold symmetries, at 
least locally, which indicates a sort of icosahedral organisation in real space with 
five-fold symmetries, 

iii) a spatial organisation of those Bragg peaks obeying specific scale invariance, more 
precisely invariance under dilations by a factor equal to some power of the golden 
mean r = 1+ 2 V ^ and manifestly consistent with the five-fold symmetry since r = 
2cosff. 

One-dimensional examples which are usually presented as toy geometrical models of 
quasicrystals jlU] appertain to the so-called Fibonacci chain family. They are discrete 
quasiperiodic subsets of the real line and are often presented as an illustration of the cut- 
and-project method, mainly developed in this context by [231 Consider a semi-open 
band B obtained by translating the unit square through the square lattice Z 2 along the 
straight line D\ of slope e. D\ is referred to as a "cut" or "parallel" space or "physical 
space". Then project on D\ and along a straight line D2 the lattice points lying in B. 
Note that the latter points belong to a unique path made of horizontal segments (A) and 
vertical segments (B). The resulting sequence of points lying in D\ are the nodes of a 
specific Fibonacci chain if e = - and D2 = D^. Let us denote this set of nodes by T. The 
chain itself is made of the projected paths and reads . . . ABAABABAABAA .... Note 
that a short link B is never adjacent to another B whereas two adjacent long links A can 
occur. Z?2 is called the "internal" space, and BC\D2 is the "window" or "acceptance zone", 
or also "atomic surface". 

The set of Fibonacci nodes is equivalently obtained through a purely algebraic filtering 
procedure. Let us first consider the so-called extension ring of the algebraic integer r: 

Z[r] = {x = m + tit I m, n £ Z} = Z + Zr. 

It can be obtained as the projection onto D\ and along D2 of the whole square lattice Z 2 . 
There exists in this type of a ring an algebraic conjugation, called Galois automorphism, 
and defined by: 

x = m + tit 1 — > x' = m + tit', 

where r' = — ^ = is the other root of the golden mean equation x 2 = x + 1. Then 

define the point set S(O) using an internal sieving rule in the ring Z[r] itself |52j : 

E(fi) = $yx = m + nr G Z[t] x' = m - G ftj =(Z[r]nfi)'. 
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The Fibonacci point set T in the above, with link lengths A = r 2 , B = r, is precisely that 
set E(fi) with = [0,1). 

As was previously mentioned, self-similarity plays a fundamental structural role in the 
existence of quasicrystals. That property is perfectly illustrated by the Fibonacci point 
set T = E[0, 1) since we check from the algebraic definition that r 2 E[0, 1) = E[0, 1/t 2 ) C 
E[0, 1) and so t 2 T C T: this particular Fibonacci point set is self-similar with scaling 
factor equal to r 2 . Immediately we get the infinite nested sequence 

• • • C Tjr 2 ^ 2 C Tj := T '/r 2j C F/r 2j+2 C • • • , (1.1) 

as increasing aperiodic discretizations of R. Since the distance between two adjacent points 
of J-j is equal to l/r 2 - 2 or 1/t 2 -? , it is clear that the inductive limit := lim J-~ densely 

fills the real line. It is precisely this property which led us to examine in recent works 
(see |29[ |1] and references therein) the problem of constructing wavelets by following 
discretization schemes of the real line like Ql-ljl . In [I] , the construction was based on mul- 
tiresolution analysis with spline wavelets earlier elaborated by Lemarie-Rieusset |39j and 
Bernuau |11|I12| (more details will be given below). Our aim was to eventually apply these 
wavelets to the analysis of aperiodic structures, like diffraction spectra of Fibonacci chain 
or some other related spectral problem, and to compare our results with more standard 
wavelet analysis (e.g. dyadic wavelets) jS]. 

Let us recall the main features of wavelet analysis in the framework of the Hilbert 
space L 2 (M). More complete information can be found in comprehensive textbooks, like 
|43j . We shall just outline here the essential of that field whose development since the 
beginning of the eighties amazingly parallels that one of quasicrystals. Under the name 
wavelet is commonly understood a function ip(x) G L 2 (M) such that the family of functions 
ipj k{x) := 2^' 2 ip(2^x — k) for j, k G Z forms an orthonormal (in a restrictive sense) or at 
least a Riesz basis for L 2 (W). A family of vectors (y n ) in a separable Hilbert space V is a 
Riesz basis if and only if each v G V can be expressed uniquely as v = ^ n a n v n and there 
exist positive constants K2 and K2, < K\ < K2, such that 

11 2 

Ki^2\a n \ 2 < \\^2a n v n < K 2 ^2\a n \ 2 

n n n 

for all sequence of scalars a n . We can say that the v n 's are strongly linearly independent 
and, if K\ = 1 = K2, then the basis is orthonormal. A function (or a set of functions) 
generating through dilations and translations an orthonormal basis for L 2 (M) can be found 
using a multiresolution analysis of L 2 (M) (shortly MRA), a method settled by S. Mallat 
44^ and precisely based on an increasing sequence of periodic discretizations of R. The 
genuine MRA ingredients are: 

(i) one scaling function <p(x) £ L 2 (R) such that {ip(x — k)}k& is an orthonormal system, 
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(ii) the Hilbert subspace Vo which is the linear span of {<p(x — k)\k S Z} and which 
corresponds to the "central" element of the sequence of discretizations of R, 

(hi) the increasing sequence of nested Hilbert subspaces • • • Vj-i C Vj C Vj+i ■ ■ ■ which 
are defined by f(x) G Vo 43- f(2 3 x) G Vj and are such that (") ■ Vj = {0} and U - Vj is 
dense in L 2 (R), 

(iv) one wavelet, i.e. a function ip(x) such that {ip{x — k) \ k S Z} spans the orthogonal 
complement Wo of Vo in V\ = Vo © Wo- 

Note that the orthonormality of the basis {<p(x — k)}k^z of the subspace Vo is a strong 
constraint. This condition is usually weakened by just imposing that the system {<p(x — 
k)}kez be a Riesz basis of Vo- 

We thus note that the dilatation factor is genuinely 9 = 2. Indeed, the construction 
of a wavelet basis within the MRA framework relies on the fact that the lattices 2 _J Z are 
increasing for the inclusion. This property is preserved only when 9 is an integer. Then, 
what about choosing another number 9 as a scaling factor? Auscher [H] considered the 
following problem: given a real number 9 > 1, does there exist a finite set {tf>i, V>2> • • • , ipe} 
of functions in L 2 (R) such that the family 9^ 2 ^i(9^x — k), j,k G Z, 1 < i < I, is an 
orthonormal basis for L 2 (M)? Then he proved that a basis of this type exists if 9 is a 
rational number. More precisely, for 9 = p/q > 1, p and q being relatively prime integers, 
there exists a set of p — q wavelet functions satisfying the previous condition. Now, what 
about an irrational scaling factor? We already mentioned the works [111 I12j and [1], the 
latter being mainly devoted to the cases in which 9 is encountered in quasicrystallography, 
like t = (1 + VE)/2 or t 2 = (3 + \/5)/2. Other "quasicrystallographic" numbers have been 
observed: l+v2, 2+v3- All of them belong to the class of quadratic Pisot-Vijayaraghavan 
units. To such numbers are associated discretization sequences like in 

• • • C Aj-i C Aj := A/8 j C A j+1 C • • • , (1.2) 

where A = Ao is a selfsimilar Delone set with scaling factor 9, 9 A C A and is such that the 
inductive limit A^ := lim A,- densely fills the real line. Recall that by Delone set we mean 

j-*oo 

that A is uniformly discrete (the distances between any pair of points in A are greater than 
a fixed r > 0) and relatively dense (there exists R > such that R is covered by intervals 
of length 2R centered at points of A). 

In 1992 Buhmann and Micchelli ^E] proposed a construction of a wavelet spline ba- 
sis corresponding to non-uniform and non-self-similar knot sequences, which are actually 
nested sequences of Delone sets 

• • • c Aj_i c Aj c A j+1 c ■■■ . (1.3) 

They consider only two successive elements, say Ao and A% D Aq for their purpose of 
proving the existence of what they call prewavelets, with minimal support, which span the 
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orthogonal complement Wq of Vq in V\ = Vq © Wq. Here, Vq and Vi are the spaces of 
linear combinations of B -splines on Ao and Ai respectively. They first suppose that the 
"refining" of the "coarse" knot sequence Ao leading to the "finer" Ai consists in adding 
a new knot between each two adjacent knots of Ao- The case of multiple insertions is 
eventually examined. Let us now give some insight on these spline functions, so intimately 
linked to the notion of a Delone set. 

Any Delone set A determines a space of splines of order s, s > 2, in the following way. 

Definition 1.1 Let s > 2. Then Vg S ^(A) is the closed subspace o/L 2 (M) defined by 



An equivalent definition is given in terms of the restriction of functions to intervals 
determined by consecutive elements of A. Suppose A = {A ra | n S Z}, where A n < \ n +\ for 
all n 6 Z. There results from Definition 11.11 that 



Therefore, Vq (A) is the space of splines of order s with nodes in A. Let us now give a 
classical result about the existence of a Riesz basis for Vq^ (A) (HD] • 

Theorem 1.2 For all Delone sets A = {A n | n £ Z} C R and for all s > 2, there exists 
a Riesz basis {-B^ | A € A} of Vq S \a). The function (called B-spline) is the unique 
function in Vq S ^ (A) satisfying the following conditions: 

(i) supp B { x s) = [A, A'], where A' G A. 

(ii) The interval (A, A') contains exactly s — 1 points of A. 



See |6l)| for proof. Note that (i) and (ii) give precise information on the (compact) 
support of B^ whilst (iii) is a normalization condition. 

The construction of B^\x) can be carried out in various ways, by recurrence, by 
using the condition of minimal support, or by inverse Fourier transform. In the latter 
case, one can prove that the Fourier transform of b{ (x + A n ) depends on the s-tuple 
(A n+ i — A n ,...,A n+s — A n ) only. Now suppose that the Delone set A is of finite local 
complexity, which means that, for all R > 0, the point set 




F (s) (A) = {/eC^nL 2 (M)|/| 



polynomial of degree < s — 1 




|J{(A-A)n(-i?,i?)} 



AeA 
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is finite, i.e. local environments of points in A are not different in infinite fashions. Typically, 
such sets A are mathematical models for one-dimensional structures having a long-range 
order, like quasicrystals. We can then assert the following: 



Proposition 1.3 Let A C R k a Delone set of finite local complexity. Then the B- 
splines of order s based on A are of the form B^\x) = <p\(x — A), A G A, where the set 
{4>\{x) | A G A} is a finite set of functions with compact support. 



Therefore, in the finite local complexity case, it is possible to partition the indexing set 
Z for A into a finite set of equivalence classes 0, 1, . . . , q, where the equivalence between 
k and n is given by 



B { ° ] k (x + X k ) = B^ (x + A n ) , for all x . 



Correspondingly, for a given s, the point set A is partitioned into A = U_A n with K n = 

n=0 

{Afc £ A | k £ fi}. The equivalence between k and n means that A& and A n are left-hand 
ends of identical s-letter words if we identify each interval (Xk,\k+i) with a letter of the 
allowed alphabet. To each class n is biunivocally associated the function n (x) = 4>\ k (x) = 
B { °\x + A fc ), k £ fi. In this way, the space V (A) decomposes into the direct sum 



^0 (S) (A) = ©^, 

n=0 



where Vo,n is the closure of the linear span of the functions (f>n(x — A&), k £ fi. 

Let us now go back to the case in which there is self-similarity (like in (|1.2j) ) and 
finite local complexity and let us see which issue holds in term of MRA and existence and 
properties of wavelets. More concretely, let A be a Delone set of finite local complexity 
and self-similar with inflation factor 9 > 1, #A C A. Changing the scale allows us to define 
subspaces V- (A), j £ Z, as 

vj s \A) = {f(x)£L\R) | -£-/(*) = Ya x 6 e - jx ). 



dx s ^-f 

AeA 



Therefore, vj s) (A) is the space of splines of order s with nodes in the jth. scaled version of 
A. 

We now have at our disposal an inductive chain of spaces allowing analysis at any scale. 
More precisely, with the above notations, we have the following statement. 

Proposition 1.4 The sequence of subspaces (vj s \A))j e z is a 9-multiresolution analysis 
ofL 2 (R), i.e. 
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(i) for any j G Z, V^ SJ (A) zs a closed subspace of L (R), 

r«; • • • c id? (A) c y (s) (A) c y/ s) (A) <=••., 

Ujez ^ (S) ( A ) is dense in £ 2 (R), 
H a- 6 z^ } (A) = {0} ; 

(v) f{x) G y/ s) (A) i/arad onh/ if f(Q~^x) G T/ {s) (A), 

(ot,) i/iere exists a finite number of functions 4>n{x) G Vg , called scaling functions such 
that {4>n{x - ^k)}ken,o<n<q is a R^sz basis in Vq S \ 

The proof is straightforward from definitions and Proposition 11.31 
As a consequence of the the above statements, we have the important result obtained 
by Bernuau [TT1IT2]: 

Theorem 1.5 Let A = {A n | n G Z} C R be a Delone set of finite local complexity, self- 
similar with factor 8 > 1. Let us denote the elements of 9~ 1 A by K n = 6~ 1 X n , n G Z. Then 
for all s > 1 f/iere exists a Riesz basis of L 2 (R) of the form: 

" «n), G -1 A, K n+ i ^ A, J G Z j , 
where J ^ s a finite set of compactly supported functions of order C s ~ 2 . 

Such a result offers the possibility of constructing explicit (spline) wavelet basis for a 
large class of self-similar Delone set of finite local complexity, like in particular those ones 
obtained through the cut-and-project method. It was done for some very specific cases in 
(I]. Nevertheless, we think that a systematic study of the properties of (not necessarily 
self-similar) cut-and-project sets is still lacking. 

The aim of the present paper is to fill this gap. Its content is devoted to the study of one- 
dimensional cut-and-project sets built in a rather generic way, without supposing a priori 
any algebraic nature for the respective slopes of D\ and D<i and by using the freedom of 
making the window $7 vary in a continuous way. Hence, our results can be viewed as paving 
the way to further investigations concerning explicit wavelet constructions for arbitrary cut- 
and-project sets. Indeed, what we learn from previous works is the importance of knowing 
in a precise way the environment of each point in the Delone set, i.e. its complexity, in 
order to build the scaling functions and the associated wavelets. Moreover, we should not 
underestimate the structural importance of the nested sequence in five of its features, 
namely 
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(i) the way in which the new points (i.e. the details) intertwine the old ones at each step 
Aj C Aj+i of the increasing sequence (|1.3|) . or, equivalently, the characteristics of the 
detail sets Aj+i \ Aj, 

(ii) the way in which the sequence behaves at the limit j — > oo, i.e. the structure of its 
inductive limit which should be dense in M, 

(iii) the specific advantages brought by the self-similarity hypothesis, 

(iv) related to (i) and (iii), the specific advantages brought by the substitivity hypothesis, 
a notion which is introduced in (|6.2j) . 

(v) the relevance of a specific choice of a nested sequence with regard to the domain of 
application of the corresponding wavelet analysis. 

Note the crucial importance of the first point in connection with the so-called scaling 
or refinement equations which couple with the inclusion Vo C V\ = Vq®Wq. Suppose there 
exist spline bases (v?o,fc)fcgj () cZ' ( < i c 'i > fe)fcex 1 cz ^ or su bspaces Vq and V\ respectively, and a 
wavelet basis (V'o,fc)/ c6 j oC z f° r ^o- Then the following hilbertian decomposition should 
hold: 

<Pl,j( X ) = C ~Ljk(P0,k(x) + ^ d l,jklpO,k(x). (1.4) 

fceXo keJo 
The way the tendency coefficients c\jk and the detail coefficients d± jk behave for large k 
is a crucial question in wavelet analysis, and so the way this question depends on the set 
inclusion Ao C Ai deserves special attention. 

In consequence, we have organized the paper by following the hierarchy of questions 
(i)-(v), and furthermore including in the scheme other aspects of possible interest, like 
some considerations on numeration systems related to cut-and-project scheme. Section 
2 is devoted to the geometrical aspects of those point sets in the line issued from the 
square lattice in the plane through "cut" and "projection": definition, study of distances 
between adjacent points, properties of invariance or covariance under the group SL{2, Z) x 
{—1,1} acting on the square lattice. The material presented there is not specifically new. 
However it represents an original overview, in which one focuses on the universality of many 
features of these cut-and-project sets, independently of specific algebraic or substitutional 
characteristics. 

In Section 3 we examine the combinatorial aspects of the cut-and-project sequences 
from the point of view of the theory of languages: subword complexity, Rauzy graphs, 
and occurrence of specific classes of finite words (or factors) in the bidirectional word 
biunivocally associated to cut-and-project sequences. One can find there original results 
(Propositions I3.8t 13.111 and !3.16|) . The important case of sturmian words is also considered 
and we establish three properties. 13.181 [3.191 13.2(71 describing their factors. 

Self-similar cut-and-project sets are the object of Section 4 in which we solve (Theorem 
14. 1|) the precise relation between self-similarity and algebraic properties of the irrational 
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scaling factor(s) on one hand and the irrational numbers involved in the cut-and-project 
scheme on the other hand. 

In Section 5 we revisit the important notion of /3-integers, i.e. those real numbers which 
do not have "/?- fractional" part when expanded in "basis" > 1, in the light of their 
possible or not relation with cut and projection, and we give a necessary and sufficient 
condition for (3 (Proposition 15. 2 J) under which the positive /^-integers coincide with the 
positive part of a cut-and-project set. 

Another original part of the paper is found in Section 6. It is well known that infinite 
words associated to many cut-and-project sets present the so-called substitution invariance, 
and this property can be crucial for understanding or even for creating the relation Aj — > 
Aj + i in a multiresolution sequence of sets and the scaling equations (|l,4j) issued from the 
companion inclusion Vj C Vj+i- Now, even though the substitution invariance is absent 
for a given bidirectional infinite word u = ■ ■ ■ u-2U-i\uqUiU2 ■ ■ ■ , where Uk belongs to some 
alphabet A, there exist cases in which it could be "hidden" behind the weaker notion of 
substitutivity, which means that there exists another infinite word v = ■ ■ ■ -u^^-il^o^i^ • • • 
over an alphabet B which has substitution invariance and a letter projection tp : B — > A 
such that u = ■ ■ ■ U-2U-i\uqUiU2 ••• = ••• vp(v-2)ip(v-i)\ip(vo)'ip(vi)ip(v2) 

With regards to this property, we shall give in Section 6 an algorithm (Theorem 16.61) 
allowing to "pull back", a given word u pertaining to the algebraic cut-and-project scheme 
to the word v mentioned in the above. In order to illustrate this result, the algorithm is 
carried out on an example of cut-and-project set defined by the algebraic (Sturm) number 
1/V2. 

Eventually, we shall give in the conclusion some hints about possible applications of 
our results, mainly in direction of wavelet constructions, of mathematical diffraction, and 
of design of aperiodic pseudo-random number generators. 

2 Cut-and-project sequences 

In this section we define cut-and-project sequences arising by a projection of a 2-dimensional 
lattice. We also describe their basic properties, including the invariance under certain trans- 
formations. We further show that cut-and-project sequences are geometric representations 
of a three or two interval exchange. Codings of two interval exchanges are in one-to-one 
correspondence with mechanical words Sa^, ~s a ,l3i 

(see definition by (|!H?t|) and lj2TTj) ). 

which are in fact sturmian words (Definition 13.2(1 . 
2.1 Definition and properties 

The construction of a cut-and-project sequence starts with a choice of a 2-dimensional 
lattice L and two straight lines D\, D2. One of the lines plays the role of the space onto 
which the lattice L is projected, the other line determines the direction of the projection. 
If A is an arbitrary non-singular linear map on R 2 , then the cut-and-project sequence 
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constructed using a lattice AL and straight lines ADi, AD2 is the same as the cut-and- 
project sequence constructed using a lattice L and straight lines D±, Z?2- Therefore it is 
not necessary to consider general L and D\ , D2 ■ Some authors allow arbitrary lattice L 
and for D\,D2 take mutually orthogonal straight lines. Others, including us, prefer to fix 
the lattice Z 2 and consider arbitrary straight lines D\, D%. 

Let us take two distinct irrational numbers e, 77 and let us consider straight lines 
D\ : y = ex, D2 : y = rjx. If we choose vectors 

x\ = (l,e) and X2 = (L??) 

e — 77 77 — e 

in the subspaces Di, D2 of IR 2 , then for every lattice point (a, b) G 1? we have 

(a, b) = (b — arj)xi + (b — ae)x2 ■ 

Obviously, the projection of I? on Di along D 2 is the set 

Z[r]]xi , 

where Z[rj] is the abelian group 

Z[rj\ :={a + be\a,b€Z}. 

Similarly, the projection of I? on D2 along D\ is the set Z[e]x2, Since numbers e, 7/ are 
irrational, the mappings (a, 6) 1— > a + 677, (a, 6) 1— > a + 6e are bijections between Z 2 and ^[77], 
resp. Z 2 and Z[e]. Therefore there exists also a bijection 

★ : Z[rj\ Z[e] 

defined by the prescription 

x = a + b-q i ^ = a + 6e , 

which is called the star map. Directly from its definition we obtain 

(x + y)* = + y* for every x,y £ Z[r]] . 

Let us now introduce the definition of cut-and-project sets. It is easy to observe that, 
as a consequence of irrationality of e and 77, the sets Z[e], Z[tj] are dense in R. However, 
if instead of all the lattice Z 2 , we project only those points in Z 2 that belong to a chosen 
strip parallel to D\, the resulting set in D\ has no limit points. The width and position 
of the projected strip is determined by an interval in L>2. Formally, we have the following 
definition. 
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Definition 2.1 Let e, n be distinct irrational numbers and let be a bounded interval. 
The set 

E EjJ? (J2) = { a + brj | a, b G Z, a + be G 0} = {x G Z[r/] | G SI} 
is called a cut-and-project sets, or C&P set. The interval £1 is called the acceptance window 

The above definition is a special case of the very general 'model sets'. Important 
contributions to the study of model sets as mathematical models of quasicrystals are due 

to [31 EU EH EH El EI]- 

Let us mention some of the properties of cut-and-project sequences that follow directly 
from the definition or were derived by cited authors. 

Remark 2.2 

1. Trivially from the definition we have 

s Efl (0!) c s £il) (fi 2 ) for n t cn 2 . 

More generally, if (r2j)j g ^ is a sequence of nested bounded intervals such that 
• • • C C Oj C C • • • and [J Oj = E , 

• • • c s £)r? (r2j_i) c s £jr? (r2j) c s efl (Oj + i) c • • • 

and 

U = %] . 

2. Since Z[r/] and Z[e] are additive groups, the C&P sequence satisfies 

x + £ £)J? (fi) = X £jJ? (f2 + x*) for every x G Z[r/]. 

ITiis property further implies that £ £)J? (f2) is noi invariant under any translation, i.e. 
is aperiodic. 

3. Any model set is Delone, see \5U$ . In our one- dimensional case it implies that there 
exists a sequence (x n )n.ez, and two positive numbers r\,r 2 such that t\ < x n+ \ — x n < 
r 2 , for all n G Z, and T, £tV (0,) = {x n \ n G Z}. 

4- The density of points ofE £)V (Q) ) defined as 

e (E^n)) : = ^ 

is proportional to the length of the interval Q, see 
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5. Since VL is an interval, it is easy to see that there exists a finite set F* C Z[e] such 
that SI — Q C SI + F* . Hence there also exists a finite set F such that 

E ei „(fi)-E e ,„(fi)cE e| „(n) + F. 

Thus £ £jr; (S7) satisfies the so-called Meyer property. In fact, every model set A C W 1 
is a Meyer set, i. e. is Delone and satisfies A — A C A + F for a finite set F, cf. \5fl\l . 
Note that in this sense a cut-and-project set is a generalization of a lattice, because 
a lattice satisfies the above property with F = {0}. 

6. Since E e „(S7) is a Meyer set, it is of finite local complexity, i.e. it has only a finite 
number of local configurations of a fixed size More precisely, for g > we define 
the Q-neighbourhood of a point x £ E £)J? (fi) as 

N e (x) ={ye £ B ,,,(«) | \x - y\ < g} . 

The family of g-neighbourhoods {N e (x) — x \ x £ £ £)r? (S7)} is finite for any positive 
g. In particular, there is only finitely many distances between adjacent points of any 
C&P sequence, i.e. the set {x n+ \ — x n \ n S Z} is finite. 

7. The boundary of the acceptance interval f2 influences the structure of the C&P se- 
quence only trivially. The sets S £jr? [c, c+l), £ £jJ? [c, c+l], S £)r? (c, c+t), and S £)r? (c, c+£] 
differ at most in two points. Ifc,c + £$L Z[e], then all these sets coincide. 

8. If the acceptance window Vt is chosen to be a semi-closed interval, then the number 
and shape of g-neighbourhoods of a C&P sequence does not depend on the position 
fl, but only on its length 

9. The setY> £ ^ rj {Q), where SI = [c,c+£) or£l = (c,c+£] contains every finite configuration 
infinitely many times. More precisely, for every g > and every x G E £j}? (r2) there 
exists infinitely many points y G £ £)I/ (il), such that 

N g (x) -x = N e (y) - y . 

We say that such C&P sequences are repetitive. 

2.2 Distances 

As it was mentioned in (6) of Remark 12.21 any cut-and-project sequence has only a finite 
number of distances between adjacent points. It turns out that the number of distances 
does not exceed 3. We quote the result of [SO], which is a generalization of the famous 3- 
distance theorem, and provide algorithms for determining the distances for any particular 
acceptance interval. 
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Theorem 2.3 Let Q be a semi-closed interval. For every T, £>TI (Q) there exist positive 
numbers Ai, A2 € Z[rj\ smc/i i/iaf f/ie distances between adjacent points in E e)J (fi) take 
values in {Ai, A2, Ai + A2}. The numbers A±, A2 depend only on the parameters e,r) 
and on the length of the interval $7. They are linearly independent over Q and satisfy 
> 0, A^ < 0, and A^ - A^ > 

Di/ y = ex 




D\ :- 



Ai Ai+A2A!+A 2 Ai Ai+A 2 Ai A 2 Ai Ai+A 2 Ai A 2 Ai Ai+A 2 Ai Ai+A 2 



A B BAB 



A C A\ B AC A 




BAB 



Figure 1: Construction of a cut-and-project sequence and assignment of the infinite word 
which codes the order of distances in the cut-and-project sequence. 



More precisely, every C&P sequence T, £>TI (Q) = {x n \ n G Z} has always two or three 
type of distances between adjacent points, namely 

f {A 1 ,A 2 ,A 1 + A 2 } ifA*-A*>|Q|, 
{x n +i - x n \ n £ Z} = < (2.5) 

I {Ai,A 2 } ifA*-A* = |0|. 
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Therefore one can naturally assign to it a binary or ternary bidirectional infinite word 
u EiTj (fl) = (M„) ng z, for example in the alphabet {A, B,C}, by 



.4 
B 

C 



if x n+ i 
if x n+1 
if x n+ i 



x r 

X Tl 
X i. 



Ai, 

Ai + A 2 , 
A 2 . 



(2.6) 



An example of construction of a cut-and-project sequence together with the assignment 
of the infinite word is shown in Figure ^ 

The successor of a point x in the C&P sequence is determined using its star-map image 
x*. If O = [c, c + £), then the inequality A^ — Ag > £ from Theorem 12.51 ensures that the 
nearest right neighbour of the point x in 2^(0) is equal to x + Ai if x* G [c, c + i — A\), 
to x + A 2 if x* G [c- A£,c+£), or to x + Ai + A 2 if G [c + £- A}, c- A£). Thus we can 
define a piecewise linear map f : [c, c + £) ^ [c, c + £) which satisfies f(x*) = y* if y is the 
nearest right neighbour of x. This mapping plays an important role in our considerations. 

Definition 2.4 Let c G M, I > 0. T/ie stepping function of the interval [c,c + I) is a 
mapping f : [c, c + — > [c, c + defined by 



f(y) = 



if ye [c,c + £— A*) , 
y + A^ + A* i/ y G [c + £ — A*, c — A£) , 
i/ y G [c — A 2 , c + . 



y + A^ 
y + A} 
y + A^ 



The graph of the map / is illustrated on Figure EJ 

The stepping function / of Figure |2] has been studied in the field of dynamical systems 
under the name of three interval exchange (in case that / has two discontinuity points) or 
two interval exchange (if / has only one discontinuity point). In the former situation, the 
two discontinuity points divide the acceptance interval into three disjoint intervals, say 
£1a, £Ib, ^c> from left to right. The image /(O) = f2 is again divided into three disjoint 
intervals f(&c)> K^b), /(^a) 5 in the order from left to right. Therefore one sometimes 
uses the graphical notation shown in the following scheme. 




m c )f(n B ) f(n A ) 
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c+A* 





c + l+Al 



c 



c-A\ c+i 



y 



Figure 2: Stepping function / associated to E e)7J [c, c + £). If the distances between neigh- 
bours in the C&P set £ £j?? [c,c + £) take only two values Ai, A2, i.e. £ = A* — Ag, then the 
discontinuity points c + £ — A* and c — A£ coincide. 



The relation of three interval exchange to simultaneous approximation of a pair of 
irrational numbers is treated in [T| I26| IBTj. In the theory of symbolic dynamical systems 
one studies the orbit of a point under the mapping /, i.e. the sequence (f^)nefio 1 - 

Therefore C&P sequences can be viewed as geometric representations of three interval 
exchange transformations. 

Changing continuously the length £ of the acceptance interval O = [c,c + £) causes 
discrete changes of the triplet of distances (Ai, A2, Ai + A2). Recall that the triplet does 
not depend on c, therefore we consider c = 0. Let T, £tTI [0,£) be a C&P sequence with three 
distances between its neighbours, and let A^ > 0, Ag < 0, A^ + Ag, be the star map images 
of these distances. According to (|2.5j) . we must have £ < A^ — A£. Growing £ up to the 
value A* — Ag causes appearance of new points in the C&P sequence, which split the large 
distance Ai + A2 into two distances Ai and A2. When £ reaches the value A* — A£, the 
large distance Ai + A2 disappears completely. 

On the other hand, diminishing the length £ of the acceptance interval causes that the 
frequency of the distance Ai + A2 grows to the detriment of occurrences of the distances 
Ai and A2. This happens until £ reaches a certain limit value for which one of the distances 
Ai or A2 disappears. 

Starting from a given initial value £q, for which the set Y, e ^[0,£o) has two distances 
between adjacent points, we can determine by recurrence the increasing sequence of lengths 
£ n , n £ Z, of the acceptance windows for which T, e ^[0,£ n ) has only two distances. The 
initial value £q is determined below (Remark I2.6|) . 

^ote that we use the notation N = {1,2,3, ... } and No = {0} U N. 
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Let A* x > and A* 2 < be the star images of distances occurring in the sequence 
E EjV [0,£ n ), i.e., according to 4 = A*j - A* 2 . 



If A*! + A* 2 > then 

£n-l '■= ^nl) ^(n-l)l := ^nl + ^n2> ^(n-l)2 := ^n2- 

If A*j + A^ 2 < then 

^n-l := — A* 2 , ^(n.-l)l := A nl , ^(n-l)2 := ^nl + ^n2- 



(2.7) 



Similarly, the algorithm which determines the triple £ n +i, A( n +i)li ^(n+i)2 from the triple 
£ n , A n i, A n 2 has the inverse form 



If A ni > A n2 then 

n . A* OA* A* • A* A* A* ■ A* 

-t-n+1 •— £± n i z ^n2' ^(n+l)l '~ nl ^n2> ^(n+l)2 ' — LA n2- 

If Ani < A n2 then 

£n+i '■= 2A* X — A* 2 , A* n+1 - )1 := A nl , A^ n+1 ^ 2 := A* 2 — A nl . 



(2i 



From this algorithm it can be seen that the C&P sequences S £jJ? [0, £ n ) and E £)V [0, 
have exactly one type of distances in common. It is the shorter one among A( n _ 1 ) 1 , A( n _]_) 2 . 
Moreover, the distances in E £)r/ [0,£), for £ n _i < £ < £ n , are three, and they are given by 
the union of the sets of distances for H, £ ^[0,£ n ) and S e>7? [0,£ n _i). 

2.3 Transformations 

We would like to identify those parameters e,rj,fl which provide essentially the same cut- 
and-project sequences. For example, we have 

a + brj + Z £tV (Q) = £ £jJ7 (n + a + be) , for o,6gZ. (2.9) 

Such translation of the C&P sequence corresponds to a translation of the lattice Z 2 . The 
group of all linear transformations of the lattice Z 2 onto itself is 

G = {Ae M 2 (Z) | det A = ±1} . 
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Consider the matrix A = f° For arbitrary irrational numbers e, 77 and arbitrary interval 
ft it holds that 

£ £|77 (ft) = {p + qn \ p,q £Z, p + qe = 

= {(hv){ P q ) | p,g€Z,(l,e)(?)€n} = 

= {(l,r?)A(£) | (M)A(?) en} = 

= {(a + c?7,5 + d7?)(^) | p,q £Z, {a + ce,b + de)( p q ) £ ft} = 

= + |p,<f€Z,(l,|jg)(S)€ 5 fen} = 



= (a + cr/)!!^ fc+dr, 




Let us study the consequences of the above relation if we choose for the matrix A one of 
the three generators Ai = ( ) , A 2 = (q _°) , A3 = Q ) of the group G, 

z £tV (n) = S 1+E) 1+r? (ft) , (2.10) 
£ £ir ,(ft) = s_ e ,_,(-n), (2.11) 
£ £ir ,(ft) = r? £11 (±ft). (2.12) 

E't] 

The mentioned transformations were used in [3U] for the proof of the following theorem. 

Theorem 2.5 For every irrational numbers e, rj, e 7^ 77 and every bounded interval ft, 
there exist e G (—1,0), 77 > and an interval ft, satisfying max(l + e, —i) < |ft| < 1, suc/i 

£ e7 j(ft) = s£ £i ^(ft) , /or some s£l. 

Moreover, if |ft| 7^ 1, i/ien i/ie distances between adjacent points in X<?^(ft) are 77, 1 + 77, 
and 1 + 2fj. The distances take only two values rj, 1 + fj, only if |ft| = 1. 

According to the above theorem, every C&P sequence is geometrically similar to an- 
other C&P sequence whose parameters satisfy certain restricted conditions. In particular, 
without loss of generality we can consider e G (—1,0), n > and the length of the accep- 
tance interval ft in the range (max(l + e, — e), 1]. If moreover we are interested only in 
the ordering of the distances in the C&P sequence and not on their actual lengths, i.e. we 
consider only the infinite word u £jT? (ft) we can choose any fixed r\ > 0. The words u £j7?1 (ft), 
u £/(y2 (ft) for 771 / r\2 coincide. Therefore choosing 77 = which corresponds to a cut-and- 
project scheme with orthogonal projection, causes no loss of generality when studying only 
combinatorial properties of C&P sequences. The choice of 77 however influences geometry 
of the sequences, such as existence of self-similarity factor, etc. (cf. Section 
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Remark 2.6 Note that according to Theorem \2.h\ the length |f2| of the interval being 
equal to 1 is the only case among (max(l + e, — e), l] for which the C&P set has only two 
distances between neighbours. We shall thus take it as the initial case for the algorithm 
given in (|2T7|) . (JTHJ). We have 

4 = 1, A^ = l + e, A* 02 = e. (2.13) 

Example 2.7 As an example, let us study the case |f2| = 1, which gives a C&P sequence 
with two distances between adjacent points. Set a = —e £ (0,1) and put fi = {(3 — 1,(5] 
for some (3 £ R as the acceptance window. Since the condition a + be £ £1 rewrites as 
(3 — 1 < a — ba < (3, we obtain a = [ba + (3\ and the C&P sequence is of the form 

E_ a>IJ (0 - 1,(3] ={[ba + (3\ + bn\beZ}. (2.14) 

Since a, rj > 0, the sequence x n := |_^«+/?J +nr] is strictly increasing and thus the distances 
between adjacent points of the C&P set Y>- a ^ v ((3 — 1,(3] are of the form 

x n+l - x n = n + |_(n + 1)q + (3\ - [na + (3\ = j V + 1 ' (2.15) 

From this expression it can be seen that the distances in the C&P sequence are arranged 
in the same order as 0's and l's in the so-called lower and upper mechanical word. Recall 
that the lower mechanical word s a ^ : Z — > {0, 1} is defined by the prescription 

s a ,p(n) = L(n + l)a + (3\ - [na + (3\ , (2.16) 

where a is called the slope and (3 the intercept of the word s a g. Similarly, upper mechanical 
word s a) 3 : Z — > {0, 1} is defined by the prescription 

s a A n ) = \( n + !)« + P\ ~ \ na + £1 • ( 2 - 17 ) 

The infinite word u £ ^{(3 — 1,(3] with parameters n > 0, e = —a is in fact the lower 
mechanical word s a «. Similarly, the choice [(3,(3 + 1) for the acceptance window provides 
the upper mechanical word ~s a ,/3 ■ The mechanical words are in fact related to the well-known 
sturmian words, see Definition s. and Remark \S. M 

3 Combinatorial properties of C&P sequences 

Ordering of the distances in the C&P sequence S £j „(J2) on the real line defines naturally 
an infinite binary or ternary word u £jr? (f2) (cf. equation (|2,6|0 . In this section we describe 
some combinatorial properties of these infinite words. Some of the results derived here can 
be found in 27 . Nevertheless, the geometric approach to three interval exchange makes 
the proof simpler. 
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Obviously, geometrically similar C&P sequences correspond to the same infinite words. 
Therefore according to Theorem 12.51 we can consider only 



£ G (—1,0), 7/ > and Q = [c,c + £), (3.18) 
where max(l + e, — e) < £ < 1. 

In this case the stepping function has the form 

y + 1 + e if y G [c, c + £ - 1 - e) =: VL A , 
f(y) = \ y + l + 2e if y G [c + £ - 1 - e,c - e) =: Sl B , (3.19) 
y + e if y G [c — e, c + ^) =: • 

For simplicity, we denote the discontinuity points of the stepping function 

Si := c + £ — 1 — e , #2 := c — e . 

As was already mentioned, the infinite word u £> „(0) is defined over a binary alphabet if 
and only if the length of the acceptance window is £ = 1, because in that case Si = S2 and 
thus £1b = 0- Otherwise the alphabet of u £j „(JT) has three letters. 

Let us recall some basic notions of combinatorics on words. An alphabet A is a finite 
set of symbols - letters. A finite concatenation w of letters is called a finite word. The set 
of all finite words (including the empty word e) over the alphabet A is denoted by .4*. The 
concatenation of n letters a is denoted by a n . The length of a word w is the number of 
letters concatenated in w, it is denoted by \w\. One considers also one-directional infinite 
words 

U = UQU1U2U2, ■ ■ ■ 



and bidirectional infinite words 



U = ■ ■ ■ U-2U-IUQU1U2 ■ ■ ■ ■ 

In relation to C&P sequences, mainly bidirectional infinite words are important. We denote 
the set of such words by A z . A word w = wqWi ■ ■ ■ Wk-i is called a factor of a word u G A 1 ' 
if w = uiUi + i ■ ■ ■ iii + k-i for some i. Note that such i is called the occurrence of w in u. The 
set of factors of a word u G A z with the length n is denoted by 

C n = {uiu i+ i ■ ■ ■ u i+n -i I i G Z} . 

The set of all factors of the word u (the language of u) is denoted by 

C = |J C n . 

The number of different n-tuples that appear in the infinite word is given by the so-called 
complexity function, see for example 
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Definition 3.1 The complexity of a word u G A z is a mapping C : N — > N such that 

C(n) = #{uiu i+ i ■ ■ ■ u i+n -i | i G Z} = #C n . 
Obviously, if u is an infinite word over a /c-letter alphabet, then its complexity satisfies 

1 < C(n) < k n , for every n G N . 



It is known [53] that if there exists an n G N such that C(n) < n, then the word u is 
periodic, i.e. of the form u = ■ ■ ■ wwww ■ ■ ■ for a finite word w. An aperiodic word of 
minimal complexity thus satisfies C{n) = n + 1 for all n G N. An example of such a 
word is the word • • • 0001000 • • • on the alphabet {0, 1}. The structure of such words is 
little interesting, since the occurrence of the letter 1 is singular. Obviously, they cannot 
be obtained by a cut-and-project scheme. In order to avoid such strange phenomena, we 
consider only those words which have reasonable density of their letters. The density of a 
letter a in the infinite word u = ■ ■ ■ U-2U-i u oUiU2 • • • is defined by 

#{i eZn [—k, k] I Ui = a] 
6a := , lim ' 

if the limit exists. 

Definition 3.2 An infinite word u G A z is called sturmian if C(n) = n + 1 for all n G N 

and the densities of its letters are irrational. 

Such words have been extensively studied. We shall focus on them later in this section. 
Let us mention that the condition C(n) = n + 1 for all n G N in the case of one-directional 
infinite words already implies irrationality of the densities of letters. Our notion of sturmian 
words follows [35; however, sturmian words are often considered only as one-directional. 
A survey of properties of one-directional sturmian words can be found in (41 j . 

3.1 Complexity 

For the determination of the factors in the infinite bidirectional word u £ ^{Q) associated 
with the C&P sequence X £i7; (f2) it is essential to study the stepping function /. Its prop- 
erties imply that the word w = wqWi ■ ■ ■ w^-i in the alphabet A = {A, B,C} is a factor of 
Ue,T)(Q) if an d only if there exists an x G T, eiV (Q) such that 

x G j •/"(■£ ) G ^iwi i ■ ■ ■ i f (•£ ) G Q Wk _ 1 , 
where Qa, ^b, are defined in (|3.19|) . This means that 

w = wqw\ ■ ■ ■ Wk—i g Ck ^ o Wo n/- 1 (^ 1 )n---n/- (/c - 1) (^ fc „ 1 ) ^0. 
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In case that w = wowi ■ ■ ■ w k -i G £k, we denote 

n w := Q wo n r\n Wl ) n ■ • ■ n /"^(n^i) . 

Properties of the stepping function / imply that £l w is an interval, closed from the left, 
open from the right. Obviously, we have 

weCk 

where the union is disjoint. In order that points x*, y* G f2 belong to different intervals x* G 
^io(i)) 2/* G ^iu(2)j where / ti/ 2 ), ujW,it/ 2 ) G C k , there must exist i = 0, 1, . . . , A; — 1 
such that at least one discontinuity point of the function / lies between /*(a?*) and 
Thus boundaries between intervals Q w for w £ C k are all points z such that is a 

discontinuity point of the function /, i.e. f l {z) G {^i,^}. This implies that the number 
of different factors of the word u £ ^(Q) of length k is equal to the number of elements 

#C k = #{c, 6 U r 1 ^), . . . , /- fe+1 (<5i), &, r 1 ^), • • • , /" k+ \S 2 )} . (3.20) 

For determination of the cardinality of the set C k , i.e. complexity of the infinite word, 
we need to use two properties of the stepping function /, which follow from the irrationality 
of e. 

1) Let xefl. Then f(x) / x for all % G Z, j ^ 0. 

(3.21) 

2) Let x,y eQ. Then 3 i G Z, such that f l (x) = y iff x - y G Z[ej. 

Since c = /(Sz), the property 1) implies that c, #2, f^ 1 ^), ■ ■ ■ , f~ k+l {^2) are distinct. 
Thus 

C(k) = #C k >k + l, for all k G N . 

Equality holds only in the case that Si = 62- Therefore for C&P sequences u £jV (Q) with 
parameters e, 77, satisfying (|3.18j) it holds that 

C{k) = k + l <^ \n\ = i. 

We have thus derived the following well known fact. 

Remark 3.3 Every mechanical word (|2.16l) or Q2.17JI is a sturmian word. The opposite 
is also true J2U 

The results about the complexity function of all C&P sequences are summarized in the 
following theorem. 
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Theorem 3.4 ( [30J ) Let C be the complexity function of the infinite word u Er ,(Q) with 
f2 = [c,c + £), and let f be the corresponding stepping function. 

• Ifii Z[e], then 

C(n) =2n+l, for n E N. 

• If £ £ Z[e], then there exists a unique no £ No swc/i £/iai 




2n + 1 for n < uq , 
n + no + 1 /or n > no • 



Obviously, generic cut-and-project sequences have complexity 2n + 1. In case that the 
length of the acceptance window is in Z[e], the cut-and-project sequence has a specific 
property which is explained in the following remark. 

Remark 3.5 Theorem \3.J\ says that infinite words u £j7? (fi) with |0| £ Z[e] /mi>e complex- 
ity C(n) = n + const, for sufficiently large n. One- directional words with such complexity 
are called quasisturmian by Cassaigne in This author shows that such words have a 

sturmian structure, i. e. up to a finite prefix they are images under a morphism of a one- 
direction sturmian word. Following the same ideas, one can show that the bidirectional 
infinite word u £ ^(Q) with £ Z[e] corresponding to a C&P sequence satisfies the fol- 
lowing: there exists a sturmian word v = ■ ■ ■ v—2V—\\vqv\V2 • ■ ■ £ {0, 1} Z and finite words 
W Q ,Wi £ {A,B,C}* such that 

(«) = ••■ W V _ 2 W V _ X \W V0 W V1 W V2 ■■■ , 

i.e. it £)r? (f2) can be obtained by concatenation of words Wo,W% in the order of 0's and l's 
in the sturmian word v. 

Example 3.6 Consider e = — - and rj = t, where r = |(1 + (see Introduction). 

For the acceptance window choose Q\ = [— - + 4, — — + ll), $^2 = ^i + 2 — -. Since the 
length of the acceptance windows I = |Oi | = | O2 1 satisfies I = 7 — £ (max(l + s, e), l] , 
according to Theorem \2. 51 the distances between adjacent points in the C&P sequences 
£ £i?? (Qi), E £i7? (f22) are 1 + r, coded by the letter A; 1 + 2t, coded by the letter B; and r , 
coded by the letter C. 

Using (|2.9|) the sequences Ee^fii), S £i?? (Q2) are the same, up to a shift by 2 + 3r. Both 
of them contain 0, since £ fii, £ ^2- Figured shows a segment of the infinite words 
coding these sequences, where we mark the point in both of them. 

It can be shown that the word M £jJ? (f2i) can be obtained from the upper mechanical 
sequence s 1 _ 1 by substituting for the word wq = BACAACA and for 1 the word 

w\ = BAB AC A. Similarly, the infinite word u £;T? (ri 2 ) can be obtained from the upper 
mechanical sequence si _i by substituting for the word wq = BAG ABA and for 1 the 
word wi = CAACA. 
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I " 

BA CAA CA BA CA BA BA CA BA CAA CA BA CA BA CAA CABACA- ■■ 

\ 1 1 1 

G Z £tri (Q 2 ) 



Figure 3: Block structure of quasisturmian words. 
3.2 Properties of the language 

Let us study the language of the infinite word u £)r? (f2). Since the stepping function /n 
corresponding to the C&P sequence £ £)r/ (f2) and the stepping function fn+z corresponding 
to the C&P sequence T, e)V (Q + z) satisfy the relation 

fn+z{x) = z + f n (x - z) , 

the language C and the complexity C of a C&P sequence depend only on the length of the 
acceptance interval and not on its position. 

First we determine the density of a given factor. Recall that the density of a particular 
factor w in the infinite bidirectional word u = ■ ■ ■ u_2U-\UqU\U2 • • • is defined by 

#{i e Zn[-k,k] I UiU i+1 . . .Ui +n _i = w\ 

^ := fc ^ 2kTi ' (3 ' 22) 

if the limit exists. Elements x G S £j ^(J7) such that the word corresponding to the fc-tuple 
of the right neighbours of x is w = wowi ■ ■ -Wk~i G Ck satisfy x* G £l w . Therefore the 
occurrences of the factor w in the infinite word u £ ^{ti) are given by the set 

{x G Z[rj\ | x* G fl w } . 

Since £l w is a semi-closed interval £l w C $7, it is a C&P set, and its density is proportional 
to the length of the acceptance window (see fact 4 of Remark 12. 2p . This implies that the 
density of a factor w G C in u e<v (Q) is given by 

I I 

eu = W 

Another important property of the language of the infinite word u e ^{£l) is given by the 
following proposition. 

Proposition 3.7 The language C of the infinite word u £t „[c,c + €) is stable under mirror 
image, i.e. 

W = WqWi ■ ■ ■ W n ^i G C ==>- W = W n -iW n -2 ■ ■ ■ w G C . 

Moreover, the densities of the factors w and w coincide, g w = Qw- 
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Proof Since the language of the infinite word u EtV [c, c + £) depends only on the length £ 
of the acceptance interval and not on its position, it suffices to show the statement for the 
infinite word u £ ^[— |, |), which codes the C&P sequence E £)I/ [— |, |). If — | ^ Z[e], then 

y \-L L\-y (-L L\ 

^£,11 2> 2> ~ ^£,V\ 2' 2' 

and thus it is a centrally symmetric set and the statement of the proposition is obvious. If 
— | = a + be £ Z[e], then for the proof it suffices to realize that the central symmetry of 
the set Yi £tV [— |, |) is broken by a unique point, namely a + brj. Since every factor w G C 
occurs in u EyV [— |, |) infinitely many times, we can still use the same argument to justify 
the proposition. ■ 

For sturmian words the above property is well known, its proof can be found in |41j . 

For different lengths £±,£2 £ (max(— e, 1 + the languages of the infinite words 

u E>rj [c, c + £±) , u £iTI [c, c + £2) are different. However, if we are interested only in factors 
of a given length n, the sets C n can coincide even for infinite words corresponding to 
acceptance intervals of different lengths. Let u e ^($l) be an infinite word with the length 
of the acceptance window = £. We denote C n {£) its set of factors of length n. For 
example C\{£) is equal to the alphabet {A, B, C} for every length max(— e, 1 + e) < £ < 1. 
Let us now see how much we can change the length £ of the acceptance interval f2 without 
changing the set C n (£). 

Proposition 3.8 Let n G N be fixed. Denote by the complexity function of the infinite 
word u £tTI (£l) with = [c,c + £). Define 

V n = {£ I max(-e, 1 + e) < £ < 1, C e (n) < 2n + l} . 

Then elements ofT> n divide the interval (max(— e, 1 +e),l] into a finite disjoint union of 
sub-intervals, such that C n {£) is constant on the interior of each of these intervals. 

The proof for special case e = — -, r\ = r, can be found in [17]. The demonstration of 
the general statement follows analogous ideas. 

Example 3.9 Consider again the parameters e = r/ = r . For the sake of illustra- 
tion of the previous proposition, let us choose n = 4 and find the division of the interval 
(max(— e, 1 + e), l] = (^, 1] into intervals such that the set Ca{£) is constant on the interior 
of these intervals. For that, we need to find £ so thatCgiji) < 2n + l forn = 4. Using (|3.2U|) 
this happens if 

f^(5 l ) = 5 2 or f {k) (5 2 ) = 5 1 , for some k = 0,1,2,3 . 

For solving these equations, one has to realize that not only the discontinuity points 5\ = 
c + £ — 1 — e, 62 = c — e depend on £, but also the prescription for the function f depends 
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on it. However, since every iteration of f is piecewise linear, the above equations can be 
easily solved. We find that 

V A = {4 - 2r, -4 + 3r, 1} . 

The division of the interval (-, 1] by the elements ofD^ is illustrated in Figure^ The figure 
also shows the set of factors of length 4 for each of the subintervals and for the singular 
lengths f G D4. Note that the set of factors corresponding to the interior of a subinterval 
is a union of sets of factors corresponding to the boundary points of the subinterval, for 
example 

C 4 (£) = £4(4 - 2r) U £ 4 (-4 + 3r) , for all £ € (4- 2r, -4 + 3r) . 



4 - 2r -4 + 3r 1 
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Figure 4: The appearance of factors of length 4 in E_i (— f2) in function of the length of 

t ' 

the acceptance window. 



3.3 Special factors 

Let us introduce some important notions which help us understand the structure of fac- 
tors in the language C The notions have been introduced in |20j . Consider arbitrary 
bidirectional infinite word v in an alphabet A, 

For every factor w £ C of v there exists at least one letter a £ A such that aw £ C. Such 
letter a is called a left extension of the factor w. The set of left extensions of the factor w 
is denoted by Lext(iu) C A. 

Remark 3.10 If w is a prefix of the factor w, then 

Lext(u>) 3 Lext(u)) . 
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Since for every factor w G C n +i we have w = aw for some w G C n and a letter 
a G Lext(w), the increment of the complexity function can be computed as 

AC(n) = C{n + 1) - C(n) = #C n+l - #£„ = (#Lext(u;) - l) . (3.23) 

Similarly one can define the notion of right extension of a factor and obtain analogical 
relation 

AC(n) = (#Rext(u>) - l) . (3.24) 

W£jC„ 

Obviously, for determining the increment of complexity, only such factors w are interesting 
that have #Lext(u>) > 2 or #Rext(w) > 2. Such factors are called left (resp. right) special 
factor. 

Let us study these notions for infinite words corresponding to C&P sequences. Propo- 
sition 13.71 implies 

w is a left special factor of u e ^{0) 
t 

w is a right special factor of u e>7? (0) . 

Therefore we can limit our considerations to the study of left special factors. Theorem 13.41 
implies that 

1 < AC(n) < 2. 

Thus for every n G N there exists at least one and at most two left special factors of length 
n. Let us explain how one can decide whether a given factor w G C n is a left special factor 
or not. Recall from the beginning of Section 13.11 that every factor w G C n is linked with 
an interval £l w C f2 of the form £l w = [a, b), where 

a, b g { C , s 1 j-\s 1 ), . . . , r fc+1 (5i), 5 2 j- 1 (5 2 ), . . . , r k+1 (h)} , 

and 5\, 5 2 are the discontinuity points of the stepping function /. 

For every x G E £jJ7 (J7) such that x* G Q w the n-tuple of distances in the right neigh- 
bourhood of the point x corresponds to the word w. The nearest left neighbour of the 
point x is determined by /~ 1 (x*). In order that the word w is a left special factor, the 
interior must contain at least one discontinuity point of / . If the discontinuity point 
of / _1 lies only on the boundary of il w , then w has only one left extension and thus is not 
a left special factor. The discontinuity points of the function f~ 1 are 

c + £ + e = f(5 1 ) and c + 1 + e = /(c) = f(5 2 ) . 

Properties Q3.21JI of the stepping function imply that if £ ^ Z[e], then the discontinuity 
points of / _1 do not belong to the set 

{c s lt r\h), r n+1 (^i), s 2 , r\s 2 ), r" +1 (5 2 )} 
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for any n E N, and, therefore, if a discontinuity point of / lies in Q w , then it lies in its 
interior. We can therefore conclude with the following proposition, which is proved in a 
different way in |27| . 

Proposition 3.11 Let £ £ Z[e]. Consider the one- directional infinite word = u^u^u^ 
i = 1,2, coding the orbits {f n {c + I + e) \ n E N } and {f n (c + 1 + e) \ n E N }. Then a 
finite word w is a left special factor of u £tV (£l) if and only if it is a prefix of or . 

Remark 3.12 Since c + £ + e is the image of Si, which is on the boundary between in- 
tervals £Ib,^c, then every prefix of the infinite word vP^ has {B,C} as its left extension. 
Similarly, every prefix of has in its left extension letters A,B. It can happen that a 
word w is a prefix of both and . Then Lext(w) = {A, B,C}. However, since the 
infinite words vP' , are different, starting from a certain length of the factor w we have 
#Lext(u>) = 2. 

3.4 Rauzy graphs 

Another important tool for the study of combinatorial properties of infinite words are the 
so-called Rauzy graphs, 17]. 

Definition 3.13 Let u be an infinite word in the alphabet A and let C n be the set of its 
factors of length n, n E N. Rauzy graph T n is a directed graph whose set of vertices is C n 
and set of directed edges is C n+ \ . The edge e E C n+ \ starts at a vertex x E C n and ends 
at a vertex y E C n , if x is a prefix of e and y is its suffix, i.e. 

e = w wi ■ ■■w n -\w n 
• • 

x = wqWi ■ ■ ■ w n -i y = w\ ■ ■ -w n -iw n 

The number of edges starting at a vertex x is called the outdegree of x and denoted by 
d + (x), the number of edges ending at x is called the indegree of x and denoted by d~(x). 

From the definition of a Rauzy graph, we have 

d + (w) = #Rext(w) and d~ (w) = #Lext(w) (3.25) 

Remark 13. 1UI implies that if r n+ i contains a vertex with outdegree K, then the graph 
r n contains a vertex with outdegree > K. Similar statement holds also for indegrees. 
Therefore 

max d - (it;) > max d~(w) and max > max d + {w). (3.26) 
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Example 3.14 Let us consider the lower mechanical word 



u,, 



n+1 




n 


T 




-T- 



n E Z, 



where r 



Using Example \2. 7[ this infinite word in the alphabet {0, 1} is a coding of the C&P sequence 
S_i „((3 — 1,0\. According to Remark \3.'A it is a sturmian word, i.e. of complexity C{n 
n+1. It can be easily computed that 



£ 3 = {010,011,101,110}, 

£ 4 = {0101, 0110, 1010, 1011, 1101} , 

C 5 = {01011,01101,10101,10110,11010,11011}. 



The Rauzy graphs T%, T4 are illustrated on Figure^ 



110< 
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-• 0101 



r 3 



101/ 
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1101»- 



010 



0110 



-• 1011 



Figure 5: Rauzy graphs for the Fibonacci word. 



Let us list some of the properties of the Rauzy graph T n of the infinite word u £tV [c, c+t) 
corresponding to a C&P sequence. 



1. The graph T n is strongly connected for every n E N. It means that for every pair 
of vertices x, y of the graph, there exists a directed path starting at x ending at y. 
This is a consequence of the repetitivity of the infinite word u £jV [c,c + £), see (9) of 
Remark El 

2. For the length of the acceptance window I = 1, the infinite word u e ^[c,c + 1) is 
sturmian and thus AC(n) = 1 for every n G N. Using (|3.23|) . ()3.24j) and (|3.25|) for 
every n the graph T n contains exactly one vertex x £ C n with outdegree 2 and exactly 
one vertex y G C n with indegree 2. These vertices may or may not coincide, as we 
have seen in Example 13.141 

For I E the Theorem 13.41 implies AC(re) = 1 for sufficiently large n. Therefore 
the graphs have the same indegrees and outdegrees as in the sturmian case. 



28 



3. If £ ^ Z[e], then using Theorem 13 . 41 we have AC(n) = 2 for all n £ N. Since the lan- 
guage of the infinite word u £iJ? [c, c + £) is stable under mirror image (Proposition 13, 71) . 
relations (|3.23|) and ()3.24j) imply that in the graph T n there is either one vertex with 
indegree 3 and one with outdegree 3, or there are two vertices with indegree 2 and 
two with outdegree 2. Remark 13.121 states that a vertex with out or indegree 3 can 
occur only in a graph T n for small n. 

4. Let us denote by T n the graph created from T n by the change of the orientation of 
the edges. Then F n and T n are isomorphic graphs, i.e. there exists a bijection tt 
between the vertices of T n and r n , such that for every two vertices x, y of T n there is 
a directed edge from x to y if and only there is a directed edge in the graph F n from 
7r(x) to n(y). This property follows from Proposition 13.71 

The last mentioned property can be stated in an even stronger version, if we consider 
the densities of factors in C n+ \ as labels of the edges in the graph T n . 

Definition 3.15 If the densities of all factors of the infinite word u are well defined, every 
edge e in the Rauzy graph T n can be assigned a non-negative number, namely the density 
g e of the factor e. The resulting graph is called a weighted Rauzy graph. 

For every vertex x of the weighted Rauzy graph T n we have obviously a 'conservation 
law', 

E ^ = £ Qf- (3-27) 

edge e ending in x edge / starting in x 

With the mentioned properties we can prove that the factors in a C&P word take at 
most 5 values. 

Proposition 3.16 Let C n be the set of factors of length n of the infinite word u EtV [c,c+£), 
£ ^ Z[e]. The densities of factors in C n take at most 5 values, i.e. 

#{Qw I w £ C n } < 5. 

Proof Consider the weighted Rauzy graph T n of u £ ,n[c, c + 1). If for every vertex x of 
T n we have d + (x) = d~(x) = 1, then the relation (|3.27|) implies that the density of the 
edge e ending at x and of the edge / starting at x coincide. Since the graph is strongly 
connected, these edges are different, e^/. Denote by y the starting vertex of the edge e 
and by z the ending vertex of the edge /. From the graph T n we remove the vertex x and 
edges e, / and replace it by a new edge starting at y and ending at z. We assign the new 
edge with the weight g e = Qf. This reduction of the graph is illustrated of Figure El 

The reduction of the Rauzy graph T n is repeated until there are no vertices with both 
outdegree and indegree 1. The resulting graph is called the reduced weighted Rauzy graph 
RT n . The construction implies that also RT n is a strongly connected graph, the weights of 
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Qe Qf A , , Qe 

• «- • J —- • is reduced to • «- • 

y x z y z 

Figure 6: Reduction of the weighted Rauzy graph. 



its edges satisfy the conservation law and the set of weights of the graph RT n is the same 
as the set of weight of the graph T n . Moreover, the graph RT n created by reversing the 
direction of edg es in HF n is isomorphic to RV n . 

Using the property 3 of the Rauzy graph T n for sufficiently large n there are two 
vertices with outdegree 2, the outdegree of the remaining vertices is 1. Similarly, there are 
two vertices with indegree 2 and the indegree of other vertices is 1. It may happen that 
a vertex with outdegree 2 coincides with a vertex with indegree 2. This implies that the 
reduced Rauzy graph has 2, 3 or 4 vertices. 

Let us discuss the case that RY n has 4 vertices, i.e. the case when none of the vertices 
has in the same time indegree and outdegree 2. It can be easily derived that the reduced 
weighted Rauzy graph has one of the forms illustrated on Figure [7| Since all the possible 
reduced graphs have six edges, the original weighted Rauzy graph has at most six different 
densities. We can eliminate the sixth value in graphs G\, G2 and G3 using the conservation 
law. In the graph G\ we have q\ = Q4 + Qq = Q3- Similarly in the graph G2 we have 
Qi = Q2 + f?5 = f?3- In the graph G3 we have Qi = Q4 — Qe = Q3- 



G\ G<i 




Figure 7: Possible reduces weighted Rauzy graphs. 



The conservation law is not sufficient for reducing the number of densities in the graph 
G4. Here we use the property 4 of the Rauzy graph of a C&P, namely that by changing the 
direction of the edges in G4 we obtain an isomorphic graph G4. The graphs are illustrated 
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on Figure |H1 The only permutation tt of the vertices which realizes the isomorphism of 
the graphs G4 and G4 is the permutation ir(x) = v, 7r(y) = z, tt(z) = y, tt(v) = x. The 
isomorphism preserves the densities, thus Q2 = Qi-, = Qa- 




Figure 8: Isomorphic reduced graphs G4 and G4. 



We have thus solved the case that the reduced weighted graph RT n has 4 vertices. If 
RT n has 2 or 3 vertices, then such a graph has at most 5 edges. Therefore there are at 
most 5 values of densities. 

For small values of n it can happen that the graph r n has one vertex with outdegree 3 
and one vertex with indegree 3, the other vertices having both outdegree and indegree 1. 
In this case the reduced Rauzy graph RT n has 1 or 2 vertices and at most 4 edges, thus 
the number of different values of densities is less or equal to 4. ■ 

Remark 3.17 In case that I S Z[e], then the densities of factors of length n of the infinite 
word u EtV [c,c + I) take at most 3 values for sufficiently large n , because the resulting 
word is either sturmian or quasisturmian and the number of densities can be read from the 
corresponding reduced Rauzy graph, which has always at most three edges. Let us mention 
that the fact that the densities of factors in sturmian words take at most three values has 
been stated in in fact, it can be deduced already from I62f . 

3.5 Sturmian words 

As we have seen, sturmian words can be defined in several different equivalent ways, namely 
as 

• bidirectional infinite words with complexity C(n) = n + land irrational densities of 
letters, 

• mechanical words s a a, s aj/ g with irrational slope a, 

• codings of cut-and-project sequences u £jV (Q), where is a semi-closed interval of unit 
length, and e, rj are irrational numbers satisfying e e (—1,0), rj > 0. 



31 



There exist other equivalent definitions, for example using the number of palindromes 
of given length or using the so-called return words. For a nice overview of these definitions 
see [13 EI]. 

The arithmetical definition of mechanical words allows one to easily derive further 
combinatorial properties of sturmian words. In Example 12.71 we have shown that the 
upper mechanical word s a ^ corresponds to a cut-and-project sequence with an acceptance 
window which is closed from the left and open from the right. The upper mechanical word 
s a g corresponds to a cut-and-project sequence with acceptance interval of opposite type. 
Since S £) ^(— O) = — £ £)J? (fi) and since the language of a sturmian sequence is closed under 
reversal, for the study of the properties of the language we can limit our considerations to 
upper mechanical words s a ,/3, see 1(2.17(1 . 

Let us now prove three properties which have been used in [3] for the construction of 
aperiodic wavelets. Note that Property 13.191 can be found already in (2~T( . 

Property 3.18 The number of letters 1 in a factor of length n of the mechanical word 
Sa,/3 is equal to [na\ or \na] . 

ProofConsider a factor w of length n, w = s a: p(i)s aj p(i + 1) • • • s a: p(i + n — 1). Since 
the alphabet of the mechanical word is {0, 1}, the number of letters 1 in w is equal to 

n-1 

^Sa^ii + j) = \(i + n)a + f3] - \ia + (3] 

3=0 

= \na + ia + j3 — \ia + (J] \ 

> v ' 

e(-i,o) 



Property 3.19 All n + 1 factors of length n of the mechanical word s Qj/ g appear in the 
factor w of length 2n of the mechanical word s a ,-a; given by 

W = S a - a (-n + l)s a - a {-n + 2) • • -Sa,~a(0) • ■ -s a - a (n) . 

Proof Example l2~71 savs that s Q)( g codes the distances in the cut-and-project sequence 
f3 + 1) for arbitrary rj > 0. Since the language of a cut-and-project sequence 
does not change by translation of the acceptance interval, we can study without loss of 
generality the language of the mechanical word s a -a, i.e. of the cut-and-project sequence 
£- a ,7j[— ol, 1 — a). The stepping function has a unique discontinuity point, namely 8% = 0. 
The same considerations as for determining the complexity in Section 13.11 lead to the 
fact that the acceptance window = [— a, 1 — a) is divided by n points Si, f~ 1 (5i), 
f~ n+1 (5i) into n + 1 disjoint subintervals closed from the left and open from the 
right, say Q w (i), & w (2), £l w (n+i), with the following property: if x,y are elements of 
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£_q,„[— a, 1 — a), then the n-tuples of distances starting from x and from y coincide if 
and only if x*, y* belong to the same interval £l w (i) for some 1 < i < n + 1. Since the 
left boundary points of all the intervals belong to Z[a], these boundary points are star 
map images of points of £_ a) „[— a, 1 — a). The boundary points of the intervals t«W are 
explicitly given by: 

-a = f(S 1 ),0 = 6 1 ,r\6),--- ,r n+1 (5i). 

Therefore it suffices to consider all n-tuples of distances in £_ a „[— a,l — a), starting at 
point 0, at its right neighbour, and at its n — 1 left neighbours. From Example 12 . 71 we know 
that every element xj~ E Y,_ a ^[— a, 1 — a) has the form xj~ = \ka — a] + krj for k E Z. 
Thus xo = and we must study the n-tuples of distances between points x_ n+ i, X-n+2, 
. . . , xq, xi, . . . , a^n+l- Since s a - a (k) codes the distance between x^ and rcfc+i, the proof 
is finished. ■ 

Property 3.20 The number of factors of length n in the mechanical words a g prefixed by 
1 is equal to \not \ . 

Proof Property 13.191 implies that, for the description of the first letter of all n + 1 
different factors of length n, it suffices to focus on letters s Qi _q,(— n + 1), s a - a (— n + 2), 
. . . , s a ,-a(0)j s a) _ a (l). The number of letters 1 among them is 

l 

~s a -a{k) = \a\ — \—na\ = 1 — [~— net] = \na \ . 

k=-n+l 

m 

Let us mention an interesting consequence of Property 13.181 If w and vJ are factors 
of Sa/3 °f the same length, then the numbers of letters 1 in w and in w' differ at most by 
1. This obviously implies that also numbers of letters in uu and w' differ at most by 1. 
Infinite words with this property are called balanced. Sturmian words are balanced. On 
the other hand, every aperiodic balanced infinite word is sturmian. We have thus obtained 
another equivalent definition of sturmian words. The above implies other properties: 

• Either 00 or 11 is not a factor of a sturmian word. 

• If 00 is not a factor of s Qj/ g or s aj/ g, and if OP'O is a factor, then x = bovx = b + l, 
where b = [j^]. 

Other interesting properties of sturmian words concern substitution invariance. This is 
the topic of the following section. 

Remark 3.21 Generic cut-and-project sequences with three distances between adjacent 
points do not have explicit formula for determining the n-th letter, which exists for sturmian 
words. Therefore the study of properties analogous to that mentioned in this subsection is 
significantly more difficult [21]. 
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4 Selfsimilarity of C&P sequences 



We now turn our attention to cut-and-project sequences with self-similarity. We say that 
a set A C 1 is self-similar if there exists a factor 7 > 1 such that 

7A c A . 

An infinite bidirectional word corresponding to a self-similar C&P set may have many 
interesting properties, namely under some very general condition it is a fixed point of a 
nontrivial morphism, or it is an image of such a fixed point. These properties are studied 
in Section El 

In this section we describe the conditions on the parameters e, 77, and interval f2, under 
which the C&P set E £i?? (f2) is self-similar. For that we need to recall some basic number 
theoretical notions that will be useful also in studying the invariance of C&P sets under 
morphisms. It turns out that e and rj are different roots of one quadratic equation with 
integer coefficients. Therefore, we restrict ourselves to notions connected to quadratic 
numbers. 

For an irrational number a we denote by Q(a) the minimal number field containing Q 
and a. If a is a quadratic number, i.e. an irrational solution of a quadratic equation with 
integer coefficients, then 

Q( Q ) = {a + ba I a,b G Q} . 

The other root a' of the quadratic equation is the algebraic conjugate of a and obviously 
we have a' G Q(a). On Q(a) one defines the mapping 

x = a + ba G Q(a) 1— > x = a + ba G a G Q(a) , 

which is (the so-called Galois) automorphism on Q(a). This means that it satisfies (x+y)' = 
x' + y' and {xy)' = x'y' for all x, y G Q(a). 

A root of a monic quadratic polynomial with integer coefficients is called a quadratic 
integer. A quadratic integer 7 is a quadratic Pisot number, if 7 > 1 and its algebraic 
conjugate 7' satisfies |7 ; | < 1. The following result may be found also in |10| . 

Theorem 4.1 

1. The C&P sequence £ £ir/ (f2) is self- similar if and only if e is a quadratic number, 
r] = e' is its algebraic conjugate, and the closure O of the acceptance f2 contains the 
origin. In that case 

£ £>r ,(ft) = S e , e /(0) = E^Cfi) = {x G Z[rj\ I x' G Q} . 

2. If 7 is the self- similarity factor of "S r) /„(Q) ) then 7 is a quadratic Pisot number in 
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Proof First let us show that if e and r\ are mutually conjugated quadratic numbers and 
G Q then S e ^(fi) is self-similar. For that we have to find a self-similarity factor 7, such 
that 7X^/^(0) C S„/ jr; (J7). 

Let e, 77 be the roots of the equation Mx 2 = Kx + L for some integers K, L, M. We 
look for 7 in the form 7 = a + Mbr] for some integers a, b. Such 7 satisfies 7 G Z[iy] and 
717 = or? + M&7/ 2 = Lb + r](a + Kb) G Z[r/]. Therefore 7Z[r/] C Z[r/]. 

Since 7 6 Z[t?] C Q(fy), we determine the image of 7 under the Galois automorphism 
7' = a + Mbrj . Clearly, 77' G Z. Since 7/ is irrational, the set Z[t/] = Z + Zr/' is dense in 
M and thus there are infinitely many choices of a, 6 G Z so that 7' 6 (0, 1). Together with 
the fact G fHt follows that 7' $7 C Q. We use the above to obtain 

^ V ', v (^) = l{x G Z[rj\ I x' G Q} = {-fx £ 7 Z[r?] | G 7'fi} C 

C {7a; G Z[r?] I j'x' eS]}C{j,e Z[r/] | y G 0} = . 

Since 7' G (0, 1) and 77' G Z, we have I7I > 1. If 7 > 1, it is the desired self-similarity 
factor, in the opposite case we choose 7 2 for the self-similarity factor. 

Let us prove the necessary condition for the self-similarity of a C&P set. Let 7 > 1 
satisfy 7X^,7 C S £;J? (J7). For a chosen point x = a + br/ G 2^(17) we have 7X G 
T, £jV (U) C Z[r/]. Therefore there must exist integers a, b such that 7X = 7(0 + 677) = a + brj. 
This implies 

a + br] —a + 07 

7 = — and r/ = — . 

a + br] ' 6 — 67 

Therefore Q(7) = Q(t/). 

Let (x n ) ne z be the strictly increasing sequence such that X £)r? (f2) = {x n \ n G Z}. 
Recall that the distances between neighbouring points of Y, £ , ri {Q,) take values x n+ \ — x n G 
{Ai, A2, Ai + A2}, where Ai, A2 are positive numbers in Z[tj] linearly independent over 
Q. Take an index n such that x n +\ — x n = Ai. Since S £fl (0) is self-similar with the factor 
7, both 7x n and 7x n +i belong to £ £j}? (r2). Therefore the gap between the two points is 
filled by distances Ai, A2 and Ai + A2. It follows that the distance 7x^+1 — ^yx n is an 
integer combination of Ai, A2 with positive coefficients, 

7A1 = 7x ra+ i - 7x„ = fen Ai + fei 2 A 2 

for some non-negative integers fen, fei2- Analogously we obtain 

7A2 = fe2iAi + fe 2 2A 2 , k 2 i,k 2 2 G N . 

We denote by K the 2x2 matrix K = (fey) and write the above as 
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This means that 7 is an eigenvalue of the integer-valued 2x2 matrix K and as such is a 
root of a monic quadratic polynomial with integer coefficients. Since Q(f?) = Q(7) and rj 
is irrational, 7 is a quadratic integer. 

The eigenvector corresponding to 7 is (^)- As ^2 belong to the quadratic field 
QO?) = Q(t)j we can a PPly the Galois automorphism to obtain the other eigenvector and 
eigenvalue of the matrix K, 



A' I 7 V A' 

The Perron-Frobenius theorem for positive integer matrices implies that |7 ; | < 7. 

We now explain the relation between the Galois automorphism and the star map in the 
cut-and-project scheme. Take any pair of points x, y G S £i ^(J7), x < y. Their distance y — x 
belongs to £ ei?7 (f2) — S e r? (r2) = S efl (J7 — f2). From the definition of the C&P set, we have 
(y — x)* G f2 — $7. Now let x = 7 m x„, y = "y m x n+ i for any integer power m and for some 
n such that x n +i — x n = Ai, or x n +i — x n = A2 respectively. From the self-similarity of 
X £i?7 (S7), the points x, y belong to E £)?7 (0), and hence (7 m Ai)*, (7 m A2)* G Q — Q. Therefore 
the sequence of vectors 

is bounded with m — ► 00. In the above we have used the property of the star map 
(kx)* = kx* for x G Z[r/] and any integer k. Since the eigenvectors of the matrix K form a 
basis of M 2 , we can write 

for some real coefficients u\, 02- Substituting into (j4.28|) we derive that the sequence of 
vectors 

is bounded. Since 7 > 1, we have ai = and < 1. We can conclude that 7 is a 

quadratic Pisot number and (^j) = 02 ( A / x )- The lengths Ai, A2 belong to Z[t/] and hence 

can be written in the form Ai = a\ + 61 77, A2 = 02 + ^2^ f° r some integers 01,02,01,62- 
We have 

a\ + bi e a% + b\r( 
012 a 2 + 6 2 e a 2 + b 2 r]' ' 

which implies (0162 — a 26i)(e — rj') = 0. Since Ai, A2 are linearly independent over Q, 
we have a\b 2 — a 2 b\ 7^ and thus e = rf as the theorem claims. The star map in such 
a cut-and-project scheme coincides with the Galois automorphism on the quadratic field 
Q(r?) = Q(e)=Q(7). 

The last to be verified is that $7 contains the origin. Since 7S Ei „(0) C E e it follows 
easily that 7'^ C £1. This implies £ fi. ■ 
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From the proof of the above theorem it follows that if e, r] are mutually conjugated 
quadratic numbers and £ H, (i.e. S e>r? (fi) is self-similar) there exists infinitely many 
factors 7 such that 7X^(0) C X £j „(n). It can be shown that all of these factors are 
quadratic Pisot numbers in Z[iy]. The exact description of all self-similarity factors of a 
given C&P set is straightforward, but rather technical. For a generalisation of self-similarity 
studied on the most common example rj = r we refer to |45j . 

Finally, let us mention that first results about self-similar Delone sets with Meyer 
property (which include C&P sets) have been obtained by Meyer in [321 • He shows that 
the self-similarity factor of such sets must be a Pisot or Salem number, i.e. an algebraic 
integer > 1 with all conjugates in the unit disc. In a even more general setting, self- 
similarity of Delone sets has been studied in |34] 135] . 



5 Non-standard numeration systems and C&P sequences 



Another example of self-similar sets are sequences formed by /3-integers. We show how they 
are related to C&P sequences. For the definition of /3-integers we introduce the notion of 
/3-expansion, which has been first given by Renyi [HE]- The /^-expansions are studied from 
the arithmetical point of view for example in [591 1551 1281 116] . 

Let j3 be a real number greater than 1. For a non- negative i£lwe find a unique k 
such that (3 k <x < (5 k+l and put 



The coefficients x^, i 6 



x k : 
i < k 



0k 



x k P k - 



1 we define recursively 
Xi := IPn+i] , n := 0r i+ i - Xi . 
The described procedure is called the greedy algorithm. It ensures that 



^ ■ 



The above expression of x using an infinite series is called the /3-expansion of x. For (3 = 2 
or j3 = 10 we obtain the usual binary or decimal expansion of x. The real numbers x 
for which the coefficients x_2 ; X-3, • • • in the /3-expansion of |x| vanish, are called 
/3-integers. They form the set denoted by Z^, 



( k k 
l i=Q i=Q 



is a /^-expansion of an x > 
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The greedy algorithm implies that if x = Si=-oo x ^ % * s the /3-expansion of a number x, 
then Y2i=-oo x i-ifi l is the /3-expansion of fix. Therefore we trivially have for /3-integers 

pip C 7Lp . 

Let us mention that Z^ has also many other self-similarity factors. 

If /3 is an integer greater than 1, the set of /3-integers coincides with rational integers, 
Z/3 = Z. Drawn on the real line, the distances between adjacent points of Zg are all 1, and 
all integers > 1 are self-similarity factors of Z«. In this case the coefficients (digits) x% in a 
/3-expansion take values 0, 1, . . . , j3 — 1, and every finite sequence formed by these digits 
is a /3-expansion of some number x. 

The situation is very different if /3 ^ Z. As a consequence of the greedy algorithm, the 
digits in a /3-expansion take values 0, 1, ... , [/3] — 1. However, not all strings of these digits 
correspond to a number x as its /3-expansion. Which sequences of digits are permissible 
in /3-expansions and what are the distances between adjacent points in Z^ depends on the 
so-called Renyi development of 1. We define a mapping 

T a (x) = px - [px] , for x £ [0, 1] . 

Put U := [pTjf 1 ^)) for i = 1, 2, 3, ... . The sequence 

d {l) = ht 2 t 3 ■ ■ ■ 

is called the Renyi development of 1. 

In order to decide, whether a series ^^Lq x i& % * s a /3-expansion, we use the condition 
of Parry [S3] . 

Proposition 5.1 Let /3 > 1. Then *Y^i=§ x ifi l * s a P-^xpansion of a number x if and only 
if the word X^X^ — ^ * * * Xq %S lexicographically strictly smaller than txt^t^ ■ ■ ■ for allO < i < n. 

In [HS] it is shown that the distances between neighbouring points in the set Z^ are of 
the form 

oo 

£^f, for , = 0,1,2,.... 

k=i H 

A necessary condition in order that Z^ has only finitely many distances between neigh- 
bouring points is that the Renyi development of 1 is eventually periodic. The construction 
of dg(l) implies that 

i=i v 

If moreover dn{V) is eventually periodic, /3 is a root of a monic polynomial with integer 
coefficients. Such /3 is an algebraic integer. 
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Our aim is to describe which parameters have to be chosen in order that the sets £ £)7? (Q) 
and coincide on the positive half-axis 2 , i.e. when 

E £>n (n) n M+ = Zp n r+. (5.30) 

In ^] an example of such a relation is given together with the parameters e, 77, 0. In 
particular, the authors study the case of quadratic Pisot units. All quadratic Pisot units 
can be expressed as the positive roots of a quadratic equation 

/3 2 = m/3 + 1 , for m > 1 or /3 2 = m/3 — 1 , for m > 3 . 

It is shown that 

ZpDR^ for /3 2 = m/3 + l, m> 1, (5.31) 

Z^nMj for /3 2 = m/3- 1, m > 3. (5.32) 

The above equalities imply that the C&P sequences with given windows have two 
distances only between adjacent points. Thus from Theorem 12.51 they are geometrically 
similar to C&P sequences with unit acceptance interval, and therefore the infinite binary 
words corresponding to /3-integers are sturmian words. 

In the following proposition we prove that a quadratic Pisot unit (3 is the only example 
of a basis for which positive /3-integers coincide with the restriction of a cut-and-project 
set to its positive part. 

Proposition 5.2 Positive part of the set Z^ coincides with the positive part of a cut-and- 
project set S E) „(0) if and only if [3 is a quadratic Pisot unit. 

Proof One implication is obvious from (|5.31j) and (|5.32|) . Let us prove =X Since 
is a self-similar set, we impose the requirement of self-similarity also on the C&P sets, 
which implies that e, n are mutually conjugated quadratic numbers, i.e. e = rf and 0£ fl. 
According to Theorem 14.11 the self-similarity factor /3 is a quadratic Pisot number. Thus 
it remains to show that /? is a unit. 

All quadratic Pisot numbers can be expressed as the positive roots of a quadratic 
equation 

1 = m/3 + n, for 1 < n < m, or /3 2 = m/3 — n, for 1 < n < m — 2. 

Our considerations can thus be divided into two cases. 
2 Positive half-axis [0, +00) is denoted by Rq\ 
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1. Let B 2 = mB + n, m > n > 1. Then the conjugated root to 8 is the number E (—1,0). 
The Renyi development of 1 has the form 

dp(l) = ran 

and the distances between /3-integers are 



tt = 1 and -^i = 8 — t\ = 3 — m = — 

8 l f-f 8 l 8 



In this case a series Yli=o x ifi % with non-negative integer coefficients rcj is a /3-expansion if 
XiXi-i is strictly lexicographically smaller than mn, i.e. -< mn for all 1 < i < k, 

which means that Xi 6 {0, 1, . . . , m} and every digit x% = m in the string XkXt-i ■ ■ ■ x±xo 
is followed by a digit < n — 1. 

Since in a self-similar C&P set the star map and the Galois automorphism coincide, we 
can find a candidate for the acceptance interval f2 in order that 1)5.30(1 be satisfied. In the 
following estimations we use 8' G (—1,0). For x = ^i=o x *<^ ^ we nave 



c' = ^2 x i8' % < m + mB' 2 + m/?' 4 + 



n? 



i=0 



Similarly, 



c' = Y^ Xi8' 1 > mB' + m/3' 3 + m/3' 5 + 



i=0 

Clearly, the only candidate for $7 is the interval [ > x It is obvious that for such 

a window one inclusion of (|5.3Uf) is verified, 

Since has only two possible distances between neighbouring elements, namely 1 and 
the equality in the above inclusion is reached according to 8. of Remark l2.2l if \£l\ = A^— A' 2 , 
i.e. if 

m m/5' m n 



L — /?' 2 ' 
m/3' 



1-/3' 2 l-/?' 2 1 + /?' * /? 

Using the quadratic equation 8' 2 = mB 1 + n we obtain the condition (1 — n)8' = which 
implies n = 1. Thus /3 is a unit. 

2. Let us study the case B 2 = md — n, m — 2 > n > 1. Here the conjugated root 8' 
belongs to the interval (0, 1). The Renyi development of 1 has coefficients t\ = m — 1 and 
ti = m — n — 1 for i > 2, i.e. 

dg(l) = (m — l)(m — n — l)(m — n — 1) • • • = (m — l)(m — n — l) w . 
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In this case a /3-expansion of a number x has digits in the set {0,1,2,... ,<m — 1} and 
forbidden are the strings of digits equal or lexicographically greater than (m — l)(m — 
n — l) s (m — n) for arbitrary non-negative integer s. The distances between neighbouring 
/3-integers are 

CO CO 

y% = l and Y t j±± = (3- tl =/3-(m-l) = l- - a . 
T^P ~[ /3 l (3 

Again, using the Galois conjugation of x = J2i=o e we nnc ^ a candidate on the 
acceptance interval £1 for (|5.3Uf) . 

k ~ 

< x' = Xif3 H < m - 1 + (m - 2)(3' + (m - 2)/?' 2 + • • • = 1 + . 
i=o l ~ f 

Let us therefore set Vt = [0, 1 + pEy )- In order that a C&P set with such an acceptance 
window have only two distances between neighbours, we must have 

After manipulations we obtain (n — l)/3' = which implies n = 1. This completes the 
proof. ■ 



6 C&P sequences and substitutions 

Construction of an arbitrarily long segment of a C&P sequence directly from the defini- 
tion of Ti £ ^(Q) is numerically very demanding, since precise computation with irrational 
numbers requires a special arithmetics dependent on the form in which the irrational num- 
bers £, r] are given. For a class of self-similar C&P sequences the sequence of distances 
between adjacent points (i.e. the infinite word it £)r/ (f2)) can be generated effectively using 
substitution rules. For this purpose we introduce the following notions. 

The set A* of finite words on an alphabet A equipped with the empty word e and the 
operation of concatenation is a free monoid. A morphism on the monoid ^4* is a map 
(p : A* — » A* satisfying ip(wz) = (p(w)tp(z) for any pair of words w, z £ A*. Clearly, the 
morphism (p is determined by (p(a) for all a £ A. The action of a morphism <p can be easily 
extended to one-directional infinite words u = uqU\U2 ■ ■ ■ over A by the prescription 

tp(u) = Lp{uQUiu 2 ■■■) = <p(u )<p(ui)ip(u 2 ) ■ ■ ■ 

Let u = uqU\U2 ■ ■ ■ be an infinite word over an alphabet A and let ip be a morphism on 
A* satisfying \<p(a)\ > 1 for all a E A and |<^(uo)| > 1- We say that the word u is invariant 
under the substitution (p if u is its fixed point, i.e. <p(u) = u. 



41 



The (/p-invariance of u implies that ip(uo) has the form <p(uo) = uqu' for some non- 
empty word v! G A* and that ip n (u) = u for every n G N. The word (p n (uo) is a prefix of 
the fixed point u and its length grows to infinity with n, therefore we can formally write 
it = lirrin^oo ip n {uo). The substitution under which an infinite word u is invariant allows 
one to generate u starting from the initial letter uq repeating the rewriting rules infinitely 
many times. 

As an example of a substitution invariant C&P sequence let us recall the /3-integers, as 
presented in the previous section. Consider first Zg for (3 2 = m/3 + 1, where the distances 
are Ai = 1 and A2 = ^. Associating the letter A to the distance 1 and the letter B to 
the distance 4 we create a one-directional infinite word u in the alphabet {A, B}. It can 
be easily seen from the properties of /3-expansions and from the Parry condition that, if 
x,y are neighbours in Z^ such that y — x = 1, then, between the points (3x and f3y, there 
is m times the distance 1 followed by one distance Similarly, if x,y are neighbours in 
Z^ such that y — x = p, then the points f3x, j3y are also neighbours and have distance 1. 
The above considerations imply that the infinite word u is not changed, if every letter A 
is replaced by the finite word A m B, and every letter B is replaced by A. We say, that the 
word u corresponding to Z^ is invariant under the substitution ip given by 

ip(A) = A m B, (p{B) = A. 

Similarly we can derive that for 0- integers where (3 2 = mf3 — 1, the infinite word u 
corresponding to is invariant under the substitution 

ip(A) = A m ~ 1 B, <p{B) = A m - 2 B. 

We have presented the substitutions only for those /^-integers that correspond to C&P 
sequences. However in general, every Z^ which has a finite number of distances between 
neighbours is invariant under a non-trivial substitution |25| . 

To every substitution ip on the alphabet A = {a%, . . . , a^} one associates naturally the 
substitution matrix M G Mfc(No), where 

Mij = the number of letters a,j in the word <p(a,i) . 

If all letters cij of the alphabet A have a well defined density Qi in the infinite word u 
invariant under the substitution <p, then the vector (q\, Q2,---, Qk) is a left eigenvector of the 
matrix M. If the matrix M is primitive, i.e. it has a positive power, then the substitution is 
called primitive. A fixed point of a primitive substitution can be represented geometrically 
self-similar sequence in the following way. 
According to the Perron-Frobenius theorem, the matrix M has an up to a scalar factor 
unique positive right eigenvector (yi, y2, . . . , yk) T corresponding to the dominant eigen- 
value, say A. To the infinite word u = u^u\U2 • • • we associate the sequence (z n )neN such 
that 

z = and z n+1 - z n = y { if u n = a { . (6.33) 
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The sequence (z n ) n< =^ is self-similar, since we have 

\{z n | n G N } C {z n \n£ N } . 

6.1 Substitution invariance 

Infinite words corresponding to C&P sequences are bidirectional. Prom the property 2 of 
Remark 12, 21 it follows that, without loss of generality, we can consider only those C&P se- 
quences which have 6 fl, i.e. G E £i?7 (f2). We define a pointed bidirectional infinite word 
it £)J? (fi) = • • • u-2U-i\uoiiiU2 • • • such that uqU\U2 ■ • • corresponds to the order of distances 
between adjacent points of £ £i?? (Q) on the right of 0, and • • • U-3U-2U-1 corresponds to the 
order of distances between adjacent points of £ £iJ? (fi) on the left of 0. This word is ternary 
or binary. 

Let u = ■ ■ ■ u-.2 u -i\ u o u i u 2 • • • be a pointed bidirectional infinite word over an alphabet 
A. Let ip be a morphism on A* such that |i^(u_i)| > 1 and |<£>(^o)| > 1- We say that the 
word u is invariant under the substitution ip, if it satisfies 

U = ■■■ U- 2 U-l\u0UlU 2 ■■■ = ■■■ Lp(u^ 2 )¥{U-l)\p{uo)Lp(ui)Lp(u2) "' = tp(u) . 

In this case we formally write 

u = lim <p n (u-i) I ip n (u ) ■ 

n— >oo 

Let us mention what is known about the substitution invariance of infinite words asso- 
ciated to C&P sequences. First consider the binary words. As explained in Example 12.71 
all such words coincide with lower and upper mechanical words s^g, ~s a ,/3 f° r irrational 
a £ (0, 1) and any real (3 G [0, 1). The question about substitution invariance of mechani- 
cal words has been solved independently by different authors |641 1151 |H] . In order to state 
the result we need to introduce the notion of a Sturm number. 

Definition 6.1 A quadratic irrational number a £ (0, 1) whose algebraic conjugate a' 
satisfies a' (0, 1) is called a Sturm number. 

Let us mention that originally Sturm numbers were defined by a special form of their 
continued fraction. The characterization presented here is due to 2 . The necessary and 
sufficient condition for substitution invariance of mechanical words is given by the following 
theorem 

Theorem 6.2 The mechanical word s a g, resp. s Qj/ g, for irrational a G (0, 1) and real 
(3 G [0,1) is invariant under a substitution if and only if 

(i) a is a Sturm number, 

(ii) (3 G Q(a), 
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(Hi) a' < & < 1 - a' or 1 - a' < < a 1 . 

If we represent the substitution invariant lower mechanical word s a s geometrically, as 
described in (|6.33|) . we find that this geometrical representation coincides with the C&P 
sequence S £j£ /(/? — 1,0\, where a = —e. Similar statement is valid for the substitution 
invariant upper mechanical word s a 3. This implies that substitution invariance of a binary 
word u associated to a C&P sequence forces existence of a self-similar C&P sequence 
£ £j£ /(f2) such that u = u £t£ /(£l). 

Substitution invariance of ternary words corresponding to C&P sequences has not yet 
been solved completely. The authors however conjecture that, even in this case, substitu- 
tion invariance forces self-similarity of the corresponding C&P sequence. 

6.2 Substitutivity 

The original aim for studying substitution properties of C&P sequences was the possibility 
of symbolic generation of u £tTj (Q). For our present purpose, it is enough to consider a 
property weaker than substitution invariance, namely the substitutivity. We take the 
formulation of Durand 24 . 

Definition 6.3 We say that the infinite word u over an alphabet A is substitutive if there 
exist an infinite word v over an alphabet B and a letter projection ip : B — > A such that v 
is invariant under a substitution (p on B* and 

■ ■ ■ lp(v^ 2 )tp(v~2)\lp(vo)lp(v 1 )lfj(v 2 ) ■■■ = ■■■ U- 2 U-i\uqUiU 2 ■ ■ ■ 

If moreover ip is a primitive substitution, then the infinite word u is said to be primitive 
substitutive. 

If an infinite word u is substitutive, it can be constructed in such a way that generating 
by substitution the word v and using the projection ip allows us to obtain u. 

Using Theorem 12.51 we can without loss of generality consider the ternary words asso- 
ciated to C&P sequences S ejJ7 [c, c + £), where 

EG (-1,0), ?7>0, c<0<c + £ and max(-e, 1 + e) < £ < 1 . (6.34) 

The description of infinite words associated to C&P sequences which are substitutive 
can be derived from the paper of Adamczewski pQ. 

Theorem 6.4 Let e,n,c,£ satisfy (|6.34j) . The infinite word u e „[c, c + £) is primitive sub- 
stitutive if and only if e is a quadratic irrational number and c, £ G Q(s). 
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From a practical point of view it is important to know the procedure which, given 
a substitutive word u over a ternary alphabet {A, B,C}, allows to determine an alpha- 
bet B, a substitution invariant word v over B and a projection tp : B — > A such that 
ip(v) = u. The bidirectional pointed word v = ■ ■ ■ i>_2U-i|uo u i' y 2 is i n fact given by the 
initial letters V-i\vq and the substitution (p under which it is invariant, since we have 
V = limn-.oo ip n (v-i)\ip n (v ). 

In the rest of this section we describe the algorithm for solving this problem in case 
that the parameters satisfy besides the necessary conditions l)6.34|l , an additional condition 
that — e is a Sturm number, i.e. the algebraic conjugate e' of e satisfies e' < -1 or e' > 0. 
Using the transformations (|2,10|) and (|2.11|) we have 



and thus we can without loss of generality consider only e' > 0. As for the parameter 77, we 
know that, for fixed e,c,£ satisfying (|6.34|) . the words u £ ^[c, c + £) coincide for all choices 
of r] > 0. In case that e' > 0, it is suitable to put r\ = e' . In this case S £)£ /[c, c + £) is 
a self-similar set. This is a crucial property for proving the correctness of the algorithm 
presented below. 

Remark 6.5 The sequence S £j£ /[c, c + £) with parameters satisfying (|fi.34|) has according 
to Theorem \2.5\ three types of distances between adjacent points, namely e', 1 + e', 1 + 2e'. 
According to Theorem \4-l\ it is a self-similar set. The stepping function f on the acceptance 
interval = [c, c + £) has in this case the form 



This function is a bisection on the acceptance interval fi ; i.e. is invertible. 



Algorithm: 

Input: quadratic e G (—1,0) with e' > 0, c, £ G Q(e), such that c < < c + £, max(— e, 1 + 
e) <t< 1. 

Output: alphabet £>, letters t>_i,t>o € £>, morphism ip on B* , projection ip : B — > A. 

Step 1 Find a quadratic unit 7 £ (0, 1) such that 7Z[e] = Z[e] and its conjugate 7' > 1. It 
results in solving a Diophantine equation (more precisely the so-called Pell equation) 
which has always a solution. 




!-„(-«) 




x + 1 + e /or x £ [c, c + — 1 — e) 
x + 1 + 2e /or xG [c + ^— 1 — e, c — 
x + e for x £ [c — e,c + £) 



= : VLa , 

e) =-.n B , 
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Step 2 For xedwe define 

gi {x) = ^r ind{x \x), where ind(x) = min{i G N | f~\x) G (6.35) 

Find the minimal set S C SI such that 

{c,c + ^- 1 -e,c-e} C 5 and gj (S) C S . 

Such a set is finite, let us denote its elements by S = {co, ci, . . . , Ck}, where c = Co < 
cx < • • • < Cfc, and denote c^+i := c + £. Note that the elements of the set S divide 
the acceptance window into small subintervals 



^ = (J[ci)Ci+l) • 

i=0 

Step 3 Define the alphabet B := {0, 1, . . . , k} and to every letter i £ B associate the number 

ji = mm{j G N | fi^Ci) G 7^} 
and the word = w)q • • • Wj _i by the prescription 

w y -=m£ B if / j (7q) G [c m , c m+ i) . 

Step 4 Define the initial letters of the infinite word v = ■ ■ ■ f_2f-i|t>o u i u 2 • • • over the alpha- 
bet B as 

v = m G B if G [c m ,c m+ i), 

v_i = m £ B if S [c m ,c m+ i). 

Step 5 Define the projection ifi : ,8 — » A = {A, B, C} by 

r a if Ci g n A , 

= I B if Ci G B , 
[ C if Cj G Oc • 

Theorem 6.6 Let parameters e, i], c, £ satisfy (|6.34|) . Let moreover e be a quadratic 
irrational number, such that e' > ; and let c,££ Q(e). Then the alphabet B, letters 
V-i,Vo G B, morphism ip on B* , and projection ifj : B — > A, defined in the above algorithm, 
satisfy 

u etV [c, c + £) = tp(v) , where v = lim f n (v-i)\ip n (vo) . 
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The proof of the theorem follows the same ideas as in |46| , where the correctness of the 
algorithm for e = — - is shown. Note that the crucial point in the algorithm is to ensure 
that the set S of Step 2 is finite. 

Remark 6.7 

1. Given the infinite word u E the substitution ip is not given uniquely. Indeed, the 
ambiguity is found in the choice of the unit 7 in Step 1 of the algorithm. Note that if 7 
has required properties, then so does any power 7 J , j G N. 

2. The cardinality of the alphabet B is given by the cardinality of the set S, which 
depends on the choice 0/7. Taking a power of '7 as the unit factor may reduce the number 
of letters in the alphabet. 

3. In case that the word u £ ^{Q) is not only substitutive but is also a fixed point of a 
substitution, then suitable choice of 7 (sufficiently high power of minimal factor satisfying 
Step 1) in the algorithm yields the substitution under which u £ ^{£l) is invariant. 

Let us illustrate the algorithm for finding the substitution on an example. 

Example 6.8 Consider the C&P sequence T, EtV [c,c + £) with parameters 

e = — 77 = £' = _, c = 0, £ = -2 + 2V2 = -2 -As. 
V 2 V 2 

Such parameters clearly satisfy the assumptions of the algorithm. The distances between 
adjacent points of T l£tV [c, c + i) are A.\ = 1 + 77 = 1 + 4g, A2 = rj = -^=, and Ai + A2 = 

1 + 2rj = 1 + \f2, and therefore the explicit expression of the stepping function in our case 
is 

' x + 1 + e for x € [0, -3 - 5e) =: tt A , 
f(x) = < x + l + 2e for x € [-3 - 5e, -e) =: D,b , 
x + e for x € [— s, -2 - 4e) =: VL C ■ 
From that we derive the formula for the inverse function 



r\x) 



x — e for x G [0, — 2 — 3e) , 
x - 1 - 2e for x G [—2 — 3e, 1 + e) , 
x — 1 — e for x G [— 1 — e, —2 — 4e) . 



We &now; i/mf G S e r? [c, c + £). We can generate other elements of the C&P sequences 
on the right from using the stepping function f . The elements on the left are generated 
using f~ l . We have 

S efl [c, c + £) = {. . . , - 5 - 877, -4 - 677, -3 - 5r?, -2 - 377, -1 - 2n, -77, 

0, I + 77, 2 + 2r/, 3 + 4r/, 4 + 5n, 5 + 677, 5 + 7t?, . . . } , 
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and graphically, 

Ai Ai + A 2 Ai Ai A 2 Ai A x Ai + A 2 Ai Ai A 2 
. . . ^ . . . . 



The corresponding bidirectional infinite word u £tV [c, c + 1) is obtained by replacing Ai 
with the letter A, A2 with the letter C and Ai + A2 with the letter B, i.e. 

u £ , v [c, c + £) = ■■ ■ BABAAC\AABAAC ■ ■ ■ 

Let us now use the algorithm described above to derive the substitution generating the 
word u e ^[c,c + t). We proceed according to the steps of the algorithm. 

Step 1 Put 7 = 3 + 4e = 3 — 2\/2- It is obvious that 7 is a unit in Z[e] n (0, 1) and its 
algebraic conjugate 7' satisfies 7' = - = 3 — 4e = 3 + 2\/2 > 1. We have yet to verify 
that 7 Z[e] = Z[e]. For that it suffices to show that 75 G Z[e] and 7 ™ 1 £ G Z[e]. We 
have 

J£ = e (3 + 4e) = 2 + 3e G Z[e], 
7 ~ 1 e = e(3-4e) = -2 + 3e G Z[e] , 

where we have used that e 2 = |. 

.Step 2 We need to /inc? f/ie minimal set S <Z £1 closed under the action of g~ f , containing the 
points 

c = 0, c + i — 1 — e = — 3 — 5e, c — e = — e. 

XTras we search for values of all iterations ffy(x), j G N, of the above points. Important 
for the definition 1)6.35(1 of the function g 7 is the index of a point x, i.e. the first 
exponent i G No such that f~' l (x) belongs to the interval 

7 = (3 + 4e) [0, -2 - 4e) = [0, -14 - 20e) . 

Let us find the image under g 7 of the point x = 0. Clearly, G 7S), i/ras ind(O) = 0. 
Therefore g y (0) = ~/°(0) = 0. ^(0) = /or a// j G N. 

Let us find the image under g 7 of the point x = —e. We have 
r\-e) = r\-l - 2e) = —2 — 3e G 7O , 

which implies 

ind(-e)=2 and gJ-s) = -r 2 (-e) = (3 - 4e)(-2 - 3e) = -e . 

7 

T7ms fl4(— e) = — e /or a// j G N. 
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Let us now find the image under g 7 of the point x = — 3 — be. We have 
/-4(_ 3 _ 5£ ) = /-3 ( _ 4 _ 6e) = / -2 ( _ 5 _ 8e) = 
= / _1 (-6 - 9e) = -7 - lOe G 7ft , 

which implies 

ind(-3-5e)=4 and g 7 (-e) = ^/~ 4 (-3 - be) = 

= (3 - 4e)(-7 - lOe) = -1 - 2e . 

Therefore the set S must contain the point —l — 2e. In order to find further iterations 
of g 1 on the point —3 — be, we determine g 7 (— 1 — 2e). VKe /icwe seen that 1 — 

2e) = -2 - 3e G 7^. Thus 

ind(-l-2e) = l and gJ-1 - 2e) = - 2e) = -e . 

7 

Altogether, we obtain 

<4(0) = 0, for all j€N, 

fl4(-e), = -e, /or a// jGN, 
<7 7 (— 3 — 5e) = — 1 — 2e, 

gi/i-3 - be) = -e, for all j G N , j > 2 . 
We can conclude that S contains four elements, 

Co = c = , ci = — 1 — 2e , C2 = —3 — 5e , C3 = — e , 
and we pui C4 = —2 — 4e. 

ATofe i/tai the elements of the set S divide the acceptance window into small subin- 
tervals 

3 

^ = lJ[Ci,Ci+l) = 
i=0 

= [0, -1 - 2e) U [-1 - 2e, -3 - 5e) U [-3 - be, -e) U [-e, -2 - 4e), 
as it is illustrated on the following figure. 

{ 1 1 " - 1 ^^) 

C Cl C 2 C 3 C 4 
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Step 3 Since S has four elements, we have the alphabet B := {0, 1,2,3} on four letters. In 
order to define the substitution we compute the iterations /• j (7q) and we stop when 
/■ ? (7Cj) G jil. First take i = 0. We have 7C0 = and the iterations 



/°(0) = G 
/i(0) = l + e G 
/ 2 (0) = 2 + 2e G 
/3( ) = 3 + 4e G 
/4( ) = 4 + 5e G 
/5(0) = 5 + 6e G 
/6(0) = 5 + 7e G 

iVoie i/iat i/ie word </?(0) = 002013 is formed by the indices m of the left- end-points 
of the intervals [c m ,c m+ i), read in the column. Similarly for i = 1, we have 7C1 = 
7(-l - 2e) = -7- lOe. Thus 



/°(-7- 


lOe) = 


-7- 


- lOe 


G 


[co,ci), 




f\-7- 


10e) = 


-6 


-9e 


G 


[co,ci), 




f 2 (-7~ 


10e) = 


-5 


-8e 


G 


[C2,C 3 ), 


hence j\ = 5 and 




10e) = 


-4 


-6e 


G 


[co,ci), 


99(1) = 00202 


fH-7- 


10e) = 


-3 


- 5e 


G 


[C2,C 3 ), 




/ 5 (-7- 


10e) = 


-2 


-3e 


G 


7 n, 





[co,ci), 
[co,ci), 

[C2,C 3 ), 

[co,ci), 
[ci,c 2 ), 

[C3,C 4 ), 

7 n, 



i/ms jo = 6 and 

<p(0) = 002013. 



For i = 2, we have 7C2 = 7 (— 3 — 5e) = —19 — 27e. Therefore 

/°(-19-27e) = -19-27e G 

/ 1 (-19-27e) = -18-26e G 

/ 2 (-19-27e) = -17-25e G 

/ 3 (-19-27e) = -16-23e G 

/ 4 (-19-27e) = -15-22e G 

/ 5 (-19-27e) = -14-20e G 

/ 6 (-19-27e) = -13-19e G 

/ 7 (-19-27e) = -12-18e G 

/ 8 (-19-27e) = -12 -lie G 

Last, for i = 3, we have 7C3 = 7 (— e) = —2 — 3e. Thus 

/°(-2-3e) = -2-3e G [c ,ci), 

/ 1 (-2-3e) = -l-2e G [ci,c 2 ), ft ence j 3 = 3 and 

/ 2 (-2-3e) = —s G [c 3 ,c 4 ), 99(3) = 013. 

/ 3 (-2-3e) = G 7 , 



[co,ci), 
[co,ci), 

[C2,C 3 ), 

[co,ci), 

[C2,C 3 ), 

[co,ci), 
[ci,c 2 ), 
[c 3 ,c 4 ), 

7 n, 



hence j 2 = 8 and 
99(2) = 00202013. 



50 



Altogether, we have the substitution 



(p(0) = 002013, 

<p(l) = 00202, 

(p(2) = 00202013, 

ip(3) = 013. 



Step 4 Since € [co,ci) and / _1 (0) = — e £ [03,04), we pui as £/ie initial letters vq = 0, 
v-i = 3. iVote i/ia£ is a prefix of <p(0) and 3 is a suffix of tp(3), thus the word 
v = linin^oo ip n (v-\)\(p n (vo) = linin^oo (p n (3)\(p n (0) is well defined. 

Step 5 Since cq,c\ C VLa, C2 C Qb, C3 C Q.c, we have the projection ip '■ {0, 1,2,3} — ► A = 
{A,B,C}by 

4(0) = ^(1)= A, 4(2) = B, V(3) = C. 

Lei ms write the subsequent iterations of the substitution ip on the pair of initial letters 
3|0, i.e. <p n (3)\p n (0). We have for n = 0,1,2, 

3 I 
013(002013 

00201300202013 0020130020130020201300201300202013 



Since each row is a factor of the next one, in the limit we obtain the infinite word 

v = --- 00201300202013|0020130020130020201300201300202013 
Now we apply the letter projection ip, which collapses the letters and 1. We have 
v=---1300202013|0020130020130020201300--- 

ue, v [c, c+£) = ■■■ ACAABABAAC\AABAA ■ ■ 



Let us study the second iteration of the above substitution (p. We obtain 





= v?(002013) = 




= 0020130020130020201300201300202013 , 


v? 2 (i) 


= (^(00202) = 




= 0020130020130020201300201300202013 , 




= (^(00202013) = 




= 002013002013002020130020130020201300201300202013 


V? 2 (3) 


= (^(013) = 




= 00201300202013 . 
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Note that y 2 (0) = </> 2 (l). Therefore we can consider the letters and 1 as identic. It 
enables us to define a new substitution (p : {A, B,C}* — ► {A, B,C}* by 

ip (A) = AABAACAABAACAABABAACAABAACAABABAAC , 
<p(B) = AAB AAC AAB AAC AAB AB AAC AAB AAC AAB AB AAC o 

o AAB AAC AAB ABA AC , 
<p{C) = AAB AAC 'AAB AB AAC , 

under which the word u £;V [c, c + £) is invariant, we namely have 

u £ Jc,c + £) = lim (p n (C)\cp n (A). 

' n— >oo 

Note that the symbol o in the formula for the substitution stands for concatenation. 

7 Conclusions 

In this paper, we have attempted to give a unifying view of the one-dimensional cut-and- 
project point sets obtained from the square lattice in the plane. A part of the work is a 
review of former results which had to be recalled for the sake of clarity. Let us now indicate 
some possible continuations or applications of our results. 

First, it would be interesting to extend the work [3] to the case of splines of larger regu- 
larity for other cut-and-project sets, with quadratic self-similarity, or without self-similarity 
at all, especially having in view the relation between scaling equations and substitution 
properties of the considered discretizations of M. 

Mathematical diffraction of such aperiodic sets obtained by cut and projection should 
be also envisaged in a systematic way, in relation with the existence of those multiresolu- 
tion analysis and related wavelets. Concerning diffraction, one can find in the literature 
on quasicrystals many works devoted to this important subject, in which substitutional 
properties, or self-similarity, or cut and projection from higher-dimensional lattices, play a 
central role in the elaboration of rigorous results (see for instance \17\ 1321 142j and [HZj for 
a recent review on these questions). The analysis of diffraction spectra by using adapted 
wavelets, i.e. wavelets "living" on the diffracting aperiodic structure, is a project which 
remains to be really developed. 

Another nice application of the results of this paper can be envisaged in the construc- 
tion of a new type of pseudo-random number generators. First step in this direction has 
been made in where the authors use sturmian sequences to combine classical periodic 
pseudo-random sequences to produce an aperiodic pseudo-random sequence. These aperi- 
odic pseudo-random number generators (APRNG) have been tested using the DIEHARD 
test suite and using the Maurer test and it turned out that statistical properties of these 
APRNG's are significantly better than of the original periodic sequences. Moreover, the 
authors prove that the APRNG passes the spectral test. It would be very interesting 
to pursue the study of the APRNG's extending the definition to generic (i.e. ternary) 
cut-and-project sequences. 
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